labelTopics(model, topics=NULL, n = 7, frexweight = 0.5)
STM
model object.
Highest Prob: are the words within each topic with the highest probability (inferred directly from topic-word distribution parameter $\beta$).
FREX: are the words that are both frequent and exclusive, identifying words that distinguish topics. This is calculated by taking the harmonic mean of rank by probability within the topic (frequency) and rank by distribution of topic given word $p(z|w=v)$ (exclusivity). In estimating exclusivity we use a James-Stein type shrinkage estimator of the distribution $p(z|w=v)$.
Score and Lift are measures provided in two other popular text mining packages. For more
information on type Score, see the R package lda
. For more
information on type Lift, see Taddy, "Multinomial Inverse Regression
for Text Analysis", Journal of the American Statistical Association 108,
2013 and the R package textir
.
stm
plot.STM