tm (version 0.7-12)

VCorpus: Volatile Corpora

Description

Create volatile corpora.

Usage

VCorpus(x, readerControl = list(reader = reader(x), language = "en"))
as.VCorpus(x)

Value

An object inheriting from VCorpus and Corpus.

Arguments

x

For VCorpus a Source object, and for as.VCorpus an R object.

readerControl

a named list of control parameters for reading in content from x.

reader

a function capable of reading in and processing the format delivered by x.

language

a character giving the language (preferably as IETF language tags, see language in package NLP). The default language is assumed to be English ("en").

Details

A volatile corpus is fully kept in memory and thus all changes only affect the corresponding R object.

See Also

Corpus for basic information on the corpus infrastructure employed by package tm.

PCorpus provides an implementation with permanent storage semantics.

Examples

Run this code
reut21578 <- system.file("texts", "crude", package = "tm")
VCorpus(DirSource(reut21578, mode = "binary"),
        list(reader = readReut21578XMLasPlain))

Run the code above in your browser using DataCamp Workspace