thank you, i have managed to convert the PDF into images and output a CSV, how would i go about formating this CSV. e.g. separate the spaces into cells
here my code so far
library(tesseract)
library(pdftools)
# Render pdf to png image
img_file <- pdftools::pdf_convert("filepath/test.pdf", format = 'tiff', dpi = 400)
# Extract text from png image
text <- ocr(img_file)
writeLines(text, "filepath/mydata.csv")