verbose
logical; if TRUE
then use this as the default
for all functions with a verbose
argument
threads
integer; specifies the number of threads to use in
parallelized functions
print_dfm_max_ndoc
integer; specifies the number of documents
to display when using the defaults for printing a dfm
print_dfm_max_nfeat
integer; specifies the number of
features to display when using the defaults for printing a dfm
base_docname
character; stem name for documents that are
unnamed when a corpus, tokens, or dfm are created or when a dfm is converted
from another object
base_featname
character; stem name for
features that are unnamed when they are added, for whatever reason, to a dfm
through an operation that adds features
base_compname
character; stem name for components that are
created by matrix factorization
language_stemmer
character; language option for char_wordstem()
,
tokens_wordstem()
, and dfm_wordstem()
pattern_hashtag
, pattern_username
character; regex patterns for
(social media) hashtags and usernames respectively, used to avoid segmenting
these in the default internal "word" tokenizer
tokens_block_size
integer; specifies the
number of documents to be tokenized at a time in blocked tokenization.
When the number is large, tokenization becomes faster but also memory-intensive.
tokens_locale
character; specify locale in stringi boundary detection in
tokenization and corpus reshaping. See stringi::stri_opts_brkiter()
.