Learn R Programming

topicmodels (version 0.0-6)

AssociatedPress: Associated Press data

Description

Associated Press data from the First Text Retrieval Conference (TREC-1) 1992

Usage

data("AssociatedPress")

Arguments

format

The data set is an object of class "DocumentTermMatrix" provided by package tm. It is a document-term matrix which contains the term frequency of 10473 terms in 2246 documents.

source

Accompanying material to the source code for fitting LDA models provided by David M. Blei. Downloaded from: http://www.cs.berkeley.edu/~blei

References

D. Harman (1992) Overview of the first text retrieval conference (TREC-1). In Proceedings of the First Text Retrieval Conference (TREC-1), 1--20.