gofastr (version 0.3.0)

filter_words: Remove Words Below a Threshold from a TermDocumentMatrix/DocumentTermMatrix

Description

Remove words from a TermDocumentMatrix or DocumentTermMatrix not meeting a rowSums/ colSums threshold.

Usage

filter_words(x, min = 1)

Arguments

min

A minimal threshold that a words row/column must sum to.

Value

Returns a TermDocumentMatrix or DocumentTermMatrix.

Examples

Run this code
# NOT RUN {
(x <-with(presidential_debates_2012, q_dtm(dialogue, paste(time, tot, sep = "_"))))
filter_words(x)
filter_words(x, 5)
(y <- with(presidential_debates_2012, q_tdm(dialogue, paste(time, tot, sep = "_"))))
filter_words(y, 6)
# }

Run the code above in your browser using DataLab