tm (version 0.5-10)

RCV1Document: RCV1 Text Document

Description

Construct an object representing a RCV1 XML text document with meta information.

Usage

RCV1Document(x, author = character(0),
             datetimestamp = as.POSIXlt(Sys.time(), tz = "GMT"),
             description = character(0), heading = character(0),
             id = character(0), origin = character(0),
             language = character(0), localmetadata = list())

Arguments

x
Object of class list containing the content.
author
Object of class character containing the author names.
datetimestamp
Object of class POSIXlt containing the date and time when the document was written.
description
Object of class character containing additional text information.
heading
Object of class character containing the title or a short heading.
id
Object of class character containing an identifier.
origin
Object of class character containing information on the source and origin of the text.
language
Object of class character containing the language of the text (preferably as IETF language tags).
localmetadata
Object of class list containing local meta data in form of tag-value pairs.

References

Lewis, D. D.; Yang, Y.; Rose, T.; and Li, F (2004). RCV1: A New Benchmark Collection for Text Categorization Research. Journal of Machine Learning Research, 5, 361--397. http://www.jmlr.org/papers/volume5/lewis04a/lewis04a.pdf

See Also

PlainTextDocument and Reuters21578Document