We need a bit more information, especially what format the papers are in, that is, pdf, html, word processor, etc, and the tye of data needed. For instance are you looking at tables, text, images or whatever?
The best idea might be to point us to an example journal document and explain what data you want to extract.