readPDF(...)
function
with the signature elem, language, load, id
:list
with the two named elements content
and uri
. The first element must hold the document to
be read in, the second element must hold a call to extract this
document. The call is evaluated upon a request for load on demand.character
vector giving the text's language.logical
value indicating whether the document
corpus should be immediately loaded into memory.character
vector representing a unique identification
string for the returned text document.PlainTextDocument
representing the text
and meta data in content
. Note that this PDF reader needs both the tools pdftotext
and
pdfinfo
installed and accessable on your system.
getReaders
to list available reader functions.