topicmodels (version 0.2-4)

AssociatedPress: Associated Press data

Description

Associated Press data from the First Text Retrieval Conference (TREC-1) 1992.

Usage

data("AssociatedPress")

Arguments

Format

The data set is an object of class "DocumentTermMatrix" provided by package tm. It is a document-term matrix which contains the term frequency of 10473 terms in 2246 documents.

Source

Accompanying material to the source code for fitting LDA models provided by David M. Blei and co-authors. Downloaded from: http://www.cs.columbia.edu/~blei/

References

D. Harman (1992) Overview of the first text retrieval conference (TREC-1). In Proceedings of the First Text Retrieval Conference (TREC-1), 1--20.