ft_extract_corpus

paths

which

further args passed on to <code>readerControl</code> parameter
in <code><a rd-options="tm" href="/link/Corpus?package=fulltext&version=0.1.6&to=tm" data-mini-rdoc="tm::Corpus">Corpus</a></code>


Extract text from one to many pdf documents into a tm Corpus or Vcorpus.


Provides a single interface to many sources of full text
'scholarly' data, including 'Biomed Central', Public Library of
Science, 'Pubmed Central', 'eLife', 'F1000Research', 'PeerJ',
'Pensoft', 'Hindawi', 'arXiv' 'preprints', and more. Functionality
included for searching for articles, downloading full or partial
text, downloading supplementary materials, converting to various
data formats used in and outside of R.

ft_extract_corpus: Extract text from one to many pdf documents into a tm Corpus or Vcorpus.

Description

Usage

Arguments

Value

See Also

Examples