getPDF

A character vector containing PDF file names.

myPDFs

An integer specifying the minimum number of letters per word
into the returned data.frame.

minword

An integer to specifying the maximum number of letters per
word into the returned data.frame.

maxword

An integer specifying the minimum word frequency into the
returned data.frame.

minFreqWord

A character containing an alternative path to XPDF
<code>pdftotext</code> function, see Details section.

pathToPdftotext

<code>getPDF</code> returns a word-occurrence data.frame from PDF files.
It needs <code>XPDF</code> in order to run (http://www.foolabs.com/xpdf/download.html),
and uses <code>parallel</code> to perform parallel computation.

A set of functions and a graphical user interface
to analyse and compare texts, using classical text mining
functions, as well as those from theoretical ecology.

getPDF: Extract text from PDF files and return a word-occurrence data.frame.

Description

Usage

Arguments

Value

Details

Examples