powered by
Report descriptive statistics for a set of texts.
text_stats(x, filter = text_filter(x))
a text corpus.
a text filter specifying the tokenization rules.
A data frame with columns named tokens, types, and sentences, with one row for each text.
tokens
types
sentences
text_stats reports descriptive statistics for a set of texts: the number of tokens, unique types, and sentences.
text_stats
text_filter, term_stats.
text_filter
term_stats
# NOT RUN { text_stats(c("A rose is a rose is a rose.", "A Rose is red. A violet is blue!")) # }
Run the code above in your browser using DataLab