tm v0.7-7


Monthly downloads



Text Mining Package

A framework for text mining applications within R.

Functions in tm

Name Description
Source Sources
PCorpus Permanent Corpora
TextDocument Text Documents
Zipf_n_Heaps Explore Corpus Term Frequency Characteristics
VectorSource Vector Source
readDOC Read In a MS Word Document
WeightFunction Weighting Function
inspect Inspect Objects
plot Visualize a Term-Document Matrix
hpc Parallelized ‘lapply’
Reader Readers
PlainTextDocument Plain Text Documents
tm_filter Filter and Index Functions on Corpora
SimpleCorpus Simple Corpora
crude 20 Exemplary News Articles from the Reuters-21578 Data Set of Topic crude
Corpus Corpora
weightSMART SMART Weightings
Docs Access Document IDs and Terms
ZipSource ZIP File Source
DirSource Directory Source
weightTfIdf Weight by Term Frequency - Inverse Document Frequency
stripWhitespace Strip Whitespace from a Text Document
VCorpus Volatile Corpora
DataframeSource Data Frame Source
findAssocs Find Associations in a Term-Document Matrix
readDataframe Read In a Text Document from a Data Frame
findFreqTerms Find Frequent Terms
getTokenizers Tokenizers
URISource Uniform Resource Identifier Source
getTransformations Transformations
termFreq Term Frequency Vector
weightTf Weight by Term Frequency
readXML Read In an XML Document
tokenizer Tokenizers
meta Metadata Management
TermDocumentMatrix Term-Document Matrix
stemDocument Stem Words
tm_combine Combine Corpora, Documents, Term-Document Matrices, and Term Frequency Vectors
acq 50 Exemplary News Articles from the Reuters-21578 Data Set of Topic acq
removePunctuation Remove Punctuation Marks from a Text Document
content_transformer Content Transformers
readPlain Read In a Text Document
readPDF Read In a PDF Document
weightBin Weight Binary
readReut21578XML Read In a Reuters-21578 XML Document
removeNumbers Remove Numbers from a Text Document
stopwords Stopwords
writeCorpus Write a Corpus to Disk
findMostFreqTerms Find Most Frequent Terms
tm_map Transformations on Corpora
readTagged Read In a POS-Tagged Word Text Document
removeSparseTerms Remove Sparse Terms from a Term-Document Matrix
XMLSource XML Source
removeWords Remove Words from a Text Document
XMLTextDocument XML Text Documents
tm_term_score Compute Score for Matching Terms
foreign Read Document-Term Matrices
readRCV1 Read In a Reuters Corpus Volume 1 Document
stemCompletion Complete Stems
tm_reduce Combine Transformations
No Results!

Vignettes of tm

No Results!

Last month downloads


Date 2019-12-12
LinkingTo BH, Rcpp
SystemRequirements C++11
License GPL-3
NeedsCompilation yes
Packaged 2019-12-12 09:37:23 UTC; hornik
Repository CRAN
Date/Publication 2019-12-12 10:06:26 UTC

Include our badge in your README