tm (version 0.6-1)

readDOC: Read In a MS Word Document


Return a function which reads in a Microsoft Word document extracting its text.


readDOC(AntiwordOptions = "")


Options passed over to antiword.


  • A function with the following formals: [object Object],[object Object],[object Object] The function returns a PlainTextDocument representing the text and metadata extracted from elem$uri.


Formally this function is a function generator, i.e., it returns a function (which reads in a text document) with a well-defined signature, but can access passed over arguments (e.g., options to antiword) via lexical scoping.

Note that this MS Word reader needs the tool antiword installed and accessible on your system. This can convert documents from Microsoft Word version 2, 6, 7, 97, 2000, 2002 and 2003 to plain text, and is available from

See Also

Reader for basic information on the reader infrastructure employed by package tm.