qdap (version 2.2.4)

stemmer: Stem Text

Description

stemmer - Stems a vector of text strings (A wrapper for the tm package's stemDocument. stem_words - Wrapper for stemmer that stems a vector of words. stem2df - Wrapper for stemmer that stems a vector of text strings and returns a dataframe with the vector added..

Usage

stemmer(text.var, rm.bracket = TRUE, capitalize = TRUE, warn = TRUE,
  char.keep = "~~", ...)

stem_words(...)

stem2df(dataframe, text.var, stem.name = NULL, ...)

Arguments

text.var
The text variable. In stemmer this is a vector text string. For stem2df this is a character vector of length one naming the text column.
rm.bracket
logical. If TRUE brackets are removed from the text.
capitalize
logical. If TRUE selected terms are capitalized.
warn
logical. If TRUE warns about rows not ending with standard qdap punctuation endmarks.
char.keep
A character vector of symbols that should be kept within sentences.
dataframe
A dataframe object.
stem.name
A character vector of length one for the stemmed column. If NULL defaults to "stem.text".
...
Various: stemmer - Other arguments passed to capitalizer stem_words - Words or terms. stem2df - Other arguments passed to

Value

  • stemmer - returns a character vector with stemmed text. stem_words - returns a vector of individually stemmed words. stem2df - returns a dataframe with a character vector with stemmed text.

See Also

capitalizer

Examples

Run this code
#stemmer EXAMPLE:
stemmer(DATA$state)
out1 <- stemmer(raj$dialogue)
htruncdf(out1, 20, 60)

#stem_words EXAMPLE:
stem_words(doggies, jumping, swims)

#stem2df EXAMPLE:
out2 <- stem2df(DATA, "state", "new")
truncdf(out2, 30)

Run the code above in your browser using DataCamp Workspace