The probability is that of observing such extreme frequencies of the considered term in the level,
under an hypergeometric distribution based on its global frequency in the corpus and on the
number of occurrences of all terms in the document or variable level considered.
The positive or negative character of the association is visible from the sign of the t value,
or by comparing the value of the
The kind of plot to be drawn is automatically chosen from the selected measure. Row percents lead to bar plots, since the total sum of shown columns (terms) doesn't add up to 100 to be drawn. Absolute counts are also represented with bar plots, so that the vertical axis reports number of occurrences.
When either several pie charts are drawn for each word, or a single word has been entered,
the string
termFrequencies
, setCorpusVariables
, meta
,
DocumentTermMatrix
, barchart
, pie