Find frequent terms in a document-term or term-document matrix.
findFreqTerms(x, lowfreq = 0, highfreq = Inf)
A numeric for the lower frequency bound.
A numeric for the upper frequency bound.
A character vector of terms in x
which occur more or equal often
than lowfreq
times and less or equal often than highfreq
times.
This method works for all numeric weightings but is probably
most meaningful for the standard term frequency (tf
) weighting
of x
.
# NOT RUN {
data("crude")
tdm <- TermDocumentMatrix(crude)
findFreqTerms(tdm, 2, 3)
# }
Run the code above in your browser using DataLab