tm (version 0.7-2)

tm_filter: Filter and Index Functions on Corpora

Description

Interface to apply filter and index functions to corpora.

Usage

# S3 method for PCorpus
tm_filter(x, FUN, …)
# S3 method for SimpleCorpus
tm_filter(x, FUN, …)
# S3 method for VCorpus
tm_filter(x, FUN, …)
# S3 method for PCorpus
tm_index(x, FUN, …)
# S3 method for SimpleCorpus
tm_index(x, FUN, …)
# S3 method for VCorpus
tm_index(x, FUN, …)

Arguments

x

A corpus.

FUN

a filter function taking a text document or a string (if x is a SimpleCorpus) as input and returning a logical value.

arguments to FUN.

Value

tm_filter returns a corpus containing documents where FUN matches, whereas tm_index only returns the corresponding indices.

Examples

Run this code
# NOT RUN {
data("crude")
# Full-text search
tm_filter(crude, FUN = function(x) any(grep("co[m]?pany", content(x))))
# }

Run the code above in your browser using DataCamp Workspace