ft_extract_corpus

paths

which

further args passed on to <code>readerControl</code> parameter
in <code><a rd-options="tm" href="/link/Corpus?package=fulltext&version=0.1.8&to=tm" data-mini-rdoc="tm::Corpus">Corpus</a></code>

Extract text from one to many pdf documents into a tm Corpus or Vcorpus.

Provides a single interface to many sources of full text
'scholarly' data, including 'Biomed Central', Public Library of
Science, 'Pubmed Central', 'eLife', 'F1000Research', 'PeerJ',
'Pensoft', 'Hindawi', 'arXiv' 'preprints', and more. Functionality
included for searching for articles, downloading full or partial
text, downloading supplementary materials, converting to various
data formats used in and outside of R.

Scott Chamberlain

fulltext

Full Text of 'Scholarly' Articles Across Many Data Sources

ft_extract_corpus function

further args passed on to <code>readerControl</code> parameter
in <code><a rd-options='tm' href='Corpus'>Corpus</a></code>

ft_extract_corpus: Extract text from one to many pdf documents into a tm Corpus or Vcorpus.

Description

Usage

Arguments

Value

See Also

Examples