Learn R Programming

ANLP (version 1.3)

generateTDM: Generate term document frequency table from corpus

Description

This function builds term documement sparse matrix

Usage

generateTDM(data, N, isTrace = F)

Arguments

data
It can be text corpus/data cleaned by cleanTextData
N
size of n-gram model
isTrace
for debugging purpose, use this if you want to track time to build model.

Value

term document matrix for terms having N words

Details

This function generates terms with N number of words specified in argument. This can be used in many tasks like information retrival, document similarity etc.

See Also

TermDocumentMatrix buildNgramModel