Parameter objects related to text analysis.
These are objects that can be used for modeling, especially in conjunction with the textrecipes package.
These objects are pre-made parameter sets that are useful in a variety of models.
max_times: frequency of word occurances for removal. See
max_tokens: the number of tokens that will be retained. See
weight: A parameter for "double normalization" when creating token counts. See
weight_scheme: the method for term frequency calculations. Possible values are: "binary", "raw count", "term frequency", "log normalization", or "double normalization". See
token: the type of token with possible values: "characters", "character_shingle", "lines", "ngrams", "paragraphs", "ptb", "regex", "sentences", "skip_ngrams", "tweets", "words", "word_stems". See
Each object is generated by either
An object of class
quant_param (inherits from
param) of length 7.