tm (version 0.7-8)

tm_filter: Filter and Index Functions on Corpora

Description

Interface to apply filter and index functions to corpora.

Usage

# S3 method for PCorpus
tm_filter(x, FUN, ...)
# S3 method for SimpleCorpus
tm_filter(x, FUN, ...)
# S3 method for VCorpus
tm_filter(x, FUN, ...)
# S3 method for PCorpus
tm_index(x, FUN, ...)
# S3 method for SimpleCorpus
tm_index(x, FUN, ...)
# S3 method for VCorpus
tm_index(x, FUN, ...)

Value

tm_filter returns a corpus containing documents where

FUN matches, whereas tm_index only returns the corresponding indices.

Arguments

x

A corpus.

FUN

a filter function taking a text document or a string (if x is a SimpleCorpus) as input and returning the logical value TRUE or FALSE.

...

arguments to FUN.

Examples

Run this code
data("crude")
# Full-text search
tm_filter(crude, FUN = function(x) any(grep("co[m]?pany", content(x))))

Run the code above in your browser using DataLab