Internal function used in textstat_keyness. Computes
keyness_chi2_dt(x, correction = c("default", "yates", "williams", "none"))keyness_chi2_stats(x)
keyness(t, f, sum_t, sum_f)
keyness_exact(x)
keyness_lr(x, correction = c("default", "yates", "williams", "none"))
keyness_pmi(x)
a dfm object
implement the Yates correction for 2x2 tables
(scalar) frequency of target
(scalar) frequency of reference
total of all target words
total of all reference words
a data.frame of chi2 and p-values with rows named for each feature
keyness_chi2_dt
uses vectorized computation from data.table
objects.
keyness_chi2_stats
uses element-by-element application of
chisq.test.
keyness_exact
computes Fisher's exact using element-by-element
application of fisher.test, returning the odds ratio.
keyness_lr
computes the
keyness_pmi
computes the Pointwise Mutual Information stat
using vectorized computation
https://en.wikipedia.org/wiki/Yates's_correction_for_continuity
http://influentialpoints.com/Training/g-likelihood_ratio_test.htm
# NOT RUN {
mydfm <- dfm(c(d1 = "a a a b b c c c c c c d e f g h h",
d2 = "a a b c c d d d d e f h"))
quanteda:::keyness_chi2_dt(mydfm)
quanteda:::keyness_chi2_stats(mydfm)
quanteda:::keyness_exact(mydfm)
quanteda:::keyness_lr(mydfm)
quanteda:::keyness_pmi(mydfm)
# }
Run the code above in your browser using DataLab