rJava package to connect to a JVM.
| Package: |
| mallet |
| Type: |
| Package |
| Version: |
| 1.0 |
| Date: |
| 2013-08-08 |
| License: |
| MIT |
Create a topic model trainer: MalletLDA
Load documents from disk and import them:
mallet.read.dir
mallet.import
Get info about word frequencies: mallet.word.freqs
Get trained model parameters:
mallet.doc.topics
mallet.topic.words
mallet.subset.topic.words
Reports on topic words:
mallet.top.words
mallet.topic.labels
Clustering of topics: mallet.topic.hclust
The Java toolkit: Andrew Kachites McCallum. The Mallet Toolkit. 2002.
Details of the fast sparse Gibbs sampling algorithm: Limin Yao, David Mimno, Andrew McCallum. Streaming Inference for Latent Dirichlet Allocation. KDD, 2009.
Hyperparameter optimization: Hanna Wallach, David Mimno, Andrew McCallum. Rethinking LDA: Why Priors Matter. NIPS, 2010.