qdap (version 2.4.3)

stemmer: Stem Text

Description

stemmer - Stems a vector of text strings (A wrapper for the tm package's stemDocument.

stem_words - Wrapper for stemmer that stems a vector of words.

stem2df - Wrapper for stemmer that stems a vector of text strings and returns a dataframe with the vector added..

Usage

stemmer(
  text.var,
  rm.bracket = TRUE,
  capitalize = TRUE,
  warn = TRUE,
  char.keep = "~~",
  ...
)

stem_words(...)

stem2df(dataframe, text.var, stem.name = NULL, ...)

Arguments

text.var

The text variable. In stemmer this is a vector text string. For stem2df this is a character vector of length one naming the text column.

rm.bracket

logical. If TRUE brackets are removed from the text.

capitalize

logical. If TRUE selected terms are capitalized.

warn

logical. If TRUE warns about rows not ending with standard qdap punctuation endmarks.

char.keep

A character vector of symbols that should be kept within sentences.

Various: stemmer - Other arguments passed to capitalizer stem_words - Words or terms. stem2df - Other arguments passed to stemmer

dataframe

A dataframe object.

stem.name

A character vector of length one for the stemmed column. If NULL defaults to "stem.text".

Value

stemmer - returns a character vector with stemmed text.

stem_words - returns a vector of individually stemmed words.

stem2df - returns a dataframe with a character vector with stemmed text.

See Also

capitalizer

Examples

Run this code
# NOT RUN {
#stemmer EXAMPLE:
stemmer(DATA$state)
out1 <- stemmer(raj$dialogue)
htruncdf(out1, 20, 60)

#stem_words EXAMPLE:
stem_words(doggies, jumping, swims)

#stem2df EXAMPLE:
out2 <- stem2df(DATA, "state", "new")
truncdf(out2, 30)
# }

Run the code above in your browser using DataCamp Workspace