tm v0.6-1


Monthly downloads



by Ingo Feinerer

Text Mining Package

A framework for text mining applications within R.

Functions in tm

Name Description
PlainTextDocument Plain Text Documents
TextDocument Text Documents
findFreqTerms Find Frequent Terms
crude 20 Exemplary News Articles from the Reuters-21578 Data Set of Topic crude
DataframeSource Data Frame Source
Docs Access Document IDs and Terms
readTagged Read In a POS-Tagged Word Text Document
removePunctuation Remove Punctuation Marks from a Text Document
ZipSource ZIP File Source
writeCorpus Write a Corpus to Disk
VCorpus Volatile Corpora
findAssocs Find Associations in a Term-Document Matrix
Corpus Corpora
readXML Read In an XML Document
getTokenizers Tokenizers
readRCV1 Read In a Reuters Corpus Volume 1 Document
PCorpus Permanent Corpora
XMLTextDocument XML Text Documents
VectorSource Vector Source
acq 50 Exemplary News Articles from the Reuters-21578 Data Set of Topic acq
WeightFunction Weighting Function
stopwords Stopwords
stripWhitespace Strip Whitespace from a Text Document
DirSource Directory Source
readDOC Read In a MS Word Document
tm_reduce Combine Transformations
tm_filter Filter and Index Functions on Corpora
readTabular Read In a Text Document
meta Metadata Management
tokenizer Tokenizers
XMLSource XML Source
tm_term_score Compute Score for Matching Terms
weightSMART SMART Weightings
foreign Read Document-Term Matrices
content_transformer Content Transformers
URISource Uniform Resource Identifier Source
tm_map Transformations on Corpora
readPlain Read In a Text Document
weightTf Weight by Term Frequency
Source Sources
removeNumbers Remove Numbers from a Text Document
weightBin Weight Binary
readReut21578XML Read In a Reuters-21578 XML Document
weightTfIdf Weight by Term Frequency - Inverse Document Frequency
stemCompletion Complete Stems
stemDocument Stem Words
readPDF Read In a PDF Document
termFreq Term Frequency Vector
inspect Inspect Objects
getTransformations Transformations
removeSparseTerms Remove Sparse Terms from a Term-Document Matrix
TermDocumentMatrix Term-Document Matrix
plot Visualize a Term-Document Matrix
tm_combine Combine Corpora, Documents, Term-Document Matrices, and Term Frequency Vectors
Zipf_n_Heaps Explore Corpus Term Frequency Characteristics
removeWords Remove Words from a Text Document
Reader Readers
No Results!

Last month downloads


Date 2015-05-06
SystemRequirements Antiword ( for reading MS Word files, pdfinfo and pdftotext from Poppler ( for reading PDF
License GPL-3
NeedsCompilation yes
Packaged 2015-05-07 04:05:51 UTC; hornik
Repository CRAN
Date/Publication 2015-05-07 07:02:02

Include our badge in your README