qdap (version 2.4.1)

Filter.all_words: Filter

Description

Filter.all_words - Filter words from a all_words that meet max/min word length criteria.

Filter.TermDocumentMatrix - Filter words from a TermDocumentMatrix vector that meet max/min word length criteria.

Filter.DocumentTermMatrix - Filter words from a DocumentTermMatrix that meet max/min word length criteria.

Filter - Filter words from various objects that meet max/min word length criteria.

Filter.wfm - Filter words from a wfm that meet max/min word length criteria.

Filter.character - Filter words from a character vector that meet max/min word length criteria.

Filter.fwl - Filter words from a fwl that meet max/min word length criteria.

Filter.fswl - Filter words from a fswl that meet max/min word length criteria.

Filter.rfswl - Filter words from a rfswl that meet max/min word length criteria.

Usage

# S3 method for all_words
Filter(
  x,
  min = 1,
  max = Inf,
  count.apostrophe = TRUE,
  stopwords = NULL,
  ignore.case = TRUE,
  ...
)

# S3 method for TermDocumentMatrix Filter( x, min = 1, max = Inf, count.apostrophe = TRUE, stopwords = NULL, ignore.case = TRUE, ... )

# S3 method for DocumentTermMatrix Filter( x, min = 1, max = Inf, count.apostrophe = TRUE, stopwords = NULL, ignore.case = TRUE, ... )

Filter( x, min = 1, max = Inf, count.apostrophe = TRUE, stopwords = NULL, ignore.case = TRUE, ... )

# S3 method for wfm Filter(x, min = 1, max = Inf, count.apostrophe = TRUE, stopwords = NULL, ...)

# S3 method for character Filter( x, min = 1, max = Inf, count.apostrophe = TRUE, stopwords = NULL, ignore.case = TRUE, ... )

# S3 method for fwl Filter( x, min = 1, max = Inf, count.apostrophe = TRUE, stopwords = NULL, ignore.case = TRUE, ... )

# S3 method for fswl Filter( x, min = 1, max = Inf, count.apostrophe = TRUE, stopwords = NULL, ignore.case = TRUE, ... )

# S3 method for rfswl Filter( x, min = 1, max = Inf, count.apostrophe = TRUE, stopwords = NULL, ignore.case = TRUE, ... )

Arguments

x

A filterable object (e.g., wfm, character).

min

Minimum word length.

max

Maximum word length.

count.apostrophe

logical. If TRUE apostrophes are counted as characters.

stopwords

A vector of stop words to remove.

ignore.case

logical. If TRUE stopwords will be removed regardless of case (ignored if used on a wfm).

Other arguments passed to specific Filter methods.

Value

Filter.all_words - Returns a matrix of the class "all_words".

Filter.TermDocumentMatrix - Returns a matrix of the class "TermDocumentMatrix".

Filter.DocumentTermMatrix - Returns a matrix of the class "DocumentTermMatrix".

Filter - Returns a matrix of the class "wfm".

Filter.character - Returns a vector of the class "character".

Filter.wfm - Returns a matrix of the class "wfm".

Filter.fwl - Returns a matrix of the class "fwl".

Filter.fswl - Returns a matrix of the class "fswl".

Filter.rfswl - Returns a matrix of the class "rfswl".

Details

all_words Method for Filter

TermDocumentMatrix Method for Filter

DocumentTermMatrix Method for Filter

character Method for Filter

fwl Method for Filter

fswl Method for Filter

rfswl Method for Filter

Examples

Run this code
# NOT RUN {
Filter(with(DATA, wfm(state, list(sex, adult))), 5)
with(DATA, wfm(state, list(sex, adult)))

## Filter particular words based on max/min values in wfm
v <- with(DATA, wfm(state, list(sex, adult)))
Filter(v, 5)
Filter(v, 5, count.apostrophe = FALSE)
Filter(v, 5, 7)
Filter(v, 4, 4)
Filter(v, 3, 4)
Filter(v, 3, 4, stopwords = Top25Words)

## Filter works on character strings too...
x <- c("Raptors don't like robots!",  "I'd pay $500.00 to rid them.")
Filter(x, 3)
Filter(x, 4)
Filter(x, 4, count.apostrophe = FALSE)
Filter(x, 4, count.apostrophe = FALSE, stopwords="raptors")
Filter(x, 4, stopwords="raptors")
Filter(x, 4, stopwords="raptors", ignore.case = FALSE)

DATA[, "state"] <- Filter(DATA[, "state"], 4)
DATA <- qdap::DATA

## Filter `all_words`
head(all_words(raj$dialogue))
Filter(head(all_words(raj$dialogue)), min = 3)
# }

Run the code above in your browser using DataLab