rJava
package to connect to a JVM.
Package: |
mallet |
Type: |
Package |
Version: |
1.0 |
Date: |
2013-08-08 |
License: |
MIT |
Create a topic model trainer: MalletLDA
Load documents from disk and import them:
mallet.read.dir
mallet.import
Get info about word frequencies: mallet.word.freqs
Get trained model parameters:
mallet.doc.topics
mallet.topic.words
mallet.subset.topic.words
Reports on topic words:
mallet.top.words
mallet.topic.labels
Clustering of topics: mallet.topic.hclust
The Java toolkit: Andrew Kachites McCallum. The Mallet Toolkit. 2002.
Details of the fast sparse Gibbs sampling algorithm: Limin Yao, David Mimno, Andrew McCallum. Streaming Inference for Latent Dirichlet Allocation. KDD, 2009.
Hyperparameter optimization: Hanna Wallach, David Mimno, Andrew McCallum. Rethinking LDA: Why Priors Matter. NIPS, 2010.