tm (version 0.6-2)

tm_filter: Filter and Index Functions on Corpora

Description

Interface to apply filter and index functions to corpora.

Usage

"tm_filter"(x, FUN, ...) "tm_filter"(x, FUN, ...) "tm_index"(x, FUN, ...) "tm_index"(x, FUN, ...)

Arguments

x
A corpus.
FUN
a filter function taking a text document as input and returning a logical value.
...
arguments to FUN.

Value

tm_filter returns a corpus containing documents where FUN matches, whereas tm_index only returns the corresponding indices.

Examples

Run this code
data("crude")
# Full-text search
tm_filter(crude, FUN = function(x) any(grep("co[m]?pany", content(x))))

Run the code above in your browser using DataLab