readDOC
From tm v0.6-2
by Ingo Feinerer
Read In a MS Word Document
Return a function which reads in a Microsoft Word document extracting its text.
Usage
readDOC(AntiwordOptions = "")
Arguments
- AntiwordOptions
- Options passed over to
antiword
.
Details
Formally this function is a function generator, i.e., it returns a
function (which reads in a text document) with a well-defined
signature, but can access passed over arguments (e.g., options to
antiword
) via lexical scoping.
Note that this MS Word reader needs the tool antiword
installed and accessible on your system. This can convert documents
from Microsoft Word version 2, 6, 7, 97, 2000, 2002 and 2003 to plain
text, and is available from http://www.winfield.demon.nl/.
Value
-
A
elem
- a list with the named component
uri
which must hold a valid file name. language
- a string giving the language.
id
- Not used.
function
with the following formals:
PlainTextDocument
representing the text
and metadata extracted from elem$uri
.
See Also
Reader
for basic information on the reader infrastructure
employed by package tm.
Community examples
Looks like there are no examples yet.