Hopefully your PDFs were not generated as images. If so, screen scraping / OCR is really the only way. Otherwise, the text data is encoded into the PDF - it is just not delimited and can be a pain to parse.
However, R has very good text parsers! This article explains the overview. I have mostly used pdftools with readr, but the tm package looks promising too. Hopefully it helps!