OCR_document: Scan PDF with optical character recognition (OCR)
Description
Extract text contained under image form in a PDF through the use
of optical character recognition software (OCR). Currently two options are
available, method = "nougat" and method = "tesseract".