proustr (version 0.4.0)

pr_stem_words: Stem a dataframe containing a column with words

Description

Implementation of the SnowballC stemmer. Note that punctuation and capitals letters are also removed.

Usage

pr_stem_words(df, col, language = "french")

Arguments

df

the data.frame containing the sentences

col

the column with the sentences

language

the language of the words Defaut is french. See SnowballC::getStemLanguages() function for a list of supported languages.

Value

a tibble

Examples

Run this code
# NOT RUN {
a <- data.frame(words = c("matin", "heure", "fatigu<U+00E9>","sonn<U+00E9>","lois", "tests","fusionner"))
pr_stem_words(a, words)

# }

Run the code above in your browser using DataCamp Workspace