tm v0.7-5


Monthly downloads



Text Mining Package

A framework for text mining applications within R.

Functions in tm

Name Description
acq 50 Exemplary News Articles from the Reuters-21578 Data Set of Topic acq
TextDocument Text Documents
findMostFreqTerms Find Most Frequent Terms
readDataframe Read In a Text Document from a Data Frame
foreign Read Document-Term Matrices
stemDocument Stem Words
readPDF Read In a PDF Document
stopwords Stopwords
weightSMART SMART Weightings
XMLSource XML Source
DirSource Directory Source
weightTf Weight by Term Frequency
XMLTextDocument XML Text Documents
findAssocs Find Associations in a Term-Document Matrix
findFreqTerms Find Frequent Terms
stripWhitespace Strip Whitespace from a Text Document
Docs Access Document IDs and Terms
termFreq Term Frequency Vector
URISource Uniform Resource Identifier Source
VCorpus Volatile Corpora
getTokenizers Tokenizers
tokenizer Tokenizers
weightBin Weight Binary
getTransformations Transformations
TermDocumentMatrix Term-Document Matrix
Corpus Corpora
DataframeSource Data Frame Source
meta Metadata Management
Reader Readers
SimpleCorpus Simple Corpora
PCorpus Permanent Corpora
readXML Read In an XML Document
VectorSource Vector Source
removeNumbers Remove Numbers from a Text Document
PlainTextDocument Plain Text Documents
ZipSource ZIP File Source
Zipf_n_Heaps Explore Corpus Term Frequency Characteristics
content_transformer Content Transformers
hpc Parallelized ‘lapply’
inspect Inspect Objects
WeightFunction Weighting Function
readReut21578XML Read In a Reuters-21578 XML Document
crude 20 Exemplary News Articles from the Reuters-21578 Data Set of Topic crude
readPlain Read In a Text Document
plot Visualize a Term-Document Matrix
readTagged Read In a POS-Tagged Word Text Document
readRCV1 Read In a Reuters Corpus Volume 1 Document
readDOC Read In a MS Word Document
removeWords Remove Words from a Text Document
stemCompletion Complete Stems
weightTfIdf Weight by Term Frequency - Inverse Document Frequency
removePunctuation Remove Punctuation Marks from a Text Document
removeSparseTerms Remove Sparse Terms from a Term-Document Matrix
writeCorpus Write a Corpus to Disk
tm_filter Filter and Index Functions on Corpora
tm_reduce Combine Transformations
tm_term_score Compute Score for Matching Terms
tm_map Transformations on Corpora
tm_combine Combine Corpora, Documents, Term-Document Matrices, and Term Frequency Vectors
Source Sources
No Results!

Vignettes of tm

No Results!

Last month downloads


Date 2018-07-29
LinkingTo BH, Rcpp
SystemRequirements C++11
License GPL-3
NeedsCompilation yes
Packaged 2018-07-29 16:33:17 UTC; hornik
Repository CRAN
Date/Publication 2018-07-29 16:56:27 UTC

Include our badge in your README