textTokenizer

text

Character. Language in text (used for stop words)

lang

Character vector. Which word do you wish to exclude?

exclude

Boolean. If you wish to keep spaces in each line
to keep unique compount words, separated with spaces, set to TRUE. 
For example, 'LA ALAMEDA' will be set as 'LA_ALAMEDA' and treated as
a single word.

keep_spaces

Boolean. Return a dataframe with a one-hot-encoding kind of
results? Each word is a column and returns if word is contained.

Integer. If df = TRUE, what is the minimum frequency for
the word to be considered.

This function transforms texts into words, calculate frequencies,
supress stop words in a given language.

R library for better/faster analytics, visualization, data mining, and machine learning tasks. With a wide variety of family functions, such as Machine Learning, Data Wrangling, Exploratory, and Scrapper, lares helps the analyst or data scientist to get quick and robust results, without the need of repetitive coding or extensive programming skills.

textTokenizer: Tokenize Vectors into Words

Description

Usage

Arguments

See Also