mallet.word.freqs

This method returns a data frame with one row for each unique vocabulary word,
and three columns: the word as a <code>character</code> value, the total number of
tokens of that word type, and the total number of documents that contain that
word at least once. This information can be useful in identifying candidate
stopwords.

An R interface for the Java Machine Learning for Language Toolkit (mallet)
<http://mallet.cs.umass.edu/> to estimate probabilistic topic models, such
as Latent Dirichlet Allocation. We can use the R package to read textual
data into mallet from R objects, run the Java implementation of mallet
directly in R, and extract results as R objects. The Mallet toolkit
has many functions, this wrapper focuses on the topic modeling sub-package
written by David Mimno. The package uses the rJava package to connect to a
JVM.

Mans Magnusson

mallet

An R Wrapper for the Java Mallet Topic Modeling Toolkit

Måns Magnusson

David Mimno

mallet.word.freqs function

<dl><dt>topic.model</dt>
<dd>A <code>cc.mallet.topics.RTopicModel</code> object created by <code>MalletLDA</code>.</dd></dl>

Arguments

Descriptive statistics of word frequencies — mallet.word.freqs

<dl>

<dt>topic.model</dt>
<dd>A <code>cc.mallet.topics.RTopicModel</code> object created by <code>MalletLDA</code>.</dd>

</dl>

mallet.word.freqs: Descriptive statistics of word frequencies

Description

Usage

Value

Arguments

See Also

Examples