readReut21578XML

0th

Percentile

Read In a Reuters-21578 XML Document

Read in a Reuters-21578 XML document.

Usage
readReut21578XML(elem, language, id) readReut21578XMLasPlain(elem, language, id)
Arguments
elem
a named list with the component content which must hold the document to be read in.
language
a string giving the language.
id
Not used.
Value

An XMLTextDocument for readReut21578XML, or a PlainTextDocument for readReut21578XMLasPlain, representing the text and metadata extracted from elem$content.

References

Lewis, David (1997) Reuters-21578 Text Categorization Collection Distribution 1.0. http://kdd.ics.uci.edu/databases/reuters21578/reuters21578.html

Luz, Saturnino XML-encoded version of Reuters-21578. http://ronaldo.cs.tcd.ie/esslli07/data/reuters21578-xml/

See Also

Reader for basic information on the reader infrastructure employed by package tm.

Aliases
  • readReut21578XML
  • readReut21578XMLasPlain
Documentation reproduced from package tm, version 0.6-2, License: GPL-3

Community examples

Looks like there are no examples yet.