Learn R Programming

RcmdrPlugin.temis (version 0.6.1)

freqTermsDlg: List most frequent terms of a corpus

Description

List terms with the highest number of occurrences in the document-term matrix of a corpus.

Arguments

Details

This dialog allows printing the most frequent terms of the corpus. If a variable is chosen, the returned terms correspond to those with the highest total among the documents within each level of the variable. If None (whole corpus) is selected, the absolute frequency of the chosen terms and their percents in occurrences of all terms in the whole corpus are returned. If Document or a variable is chosen, details about the association of the term with documents or levels are shown: [object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object]

The probability is that of observing such extreme frequencies of the considered term in the level, under an hypergeometric distribution based on its global frequency in the corpus and on the number of occurrences of all terms in the document or variable level considered. The positive or negative character of the association is visible from the sign of the t value, or by comparing the value of the % Term/Level column with that of the Global % column.

See Also

frequentTerms, setCorpusVariables, meta, restrictTermsDlg, termsDictionary