tm v0.7-8


Monthly downloads



Text Mining Package

A framework for text mining applications within R.

Functions in tm

Name Description
removePunctuation Remove Punctuation Marks from a Text Document
DirSource Directory Source
Docs Access Document IDs and Terms
ZipSource ZIP File Source
Zipf_n_Heaps Explore Corpus Term Frequency Characteristics
hpc Parallelized ‘lapply’
inspect Inspect Objects
PCorpus Permanent Corpora
SimpleCorpus Simple Corpora
content_transformer Content Transformers
stemDocument Stem Words
acq 50 Exemplary News Articles from the Reuters-21578 Data Set of Topic acq
PlainTextDocument Plain Text Documents
VectorSource Vector Source
stopwords Stopwords
WeightFunction Weighting Function
weightSMART SMART Weightings
TermDocumentMatrix Term-Document Matrix
meta Metadata Management
weightTf Weight by Term Frequency
readRCV1 Read In a Reuters Corpus Volume 1 Document
stripWhitespace Strip Whitespace from a Text Document
Corpus Corpora
readXML Read In an XML Document
removeNumbers Remove Numbers from a Text Document
DataframeSource Data Frame Source
readTagged Read In a POS-Tagged Word Text Document
crude 20 Exemplary News Articles from the Reuters-21578 Data Set of Topic crude
readReut21578XML Read In a Reuters-21578 XML Document
tm_combine Combine Corpora, Documents, Term-Document Matrices, and Term Frequency Vectors
Reader Readers
TextDocument Text Documents
Source Sources
readPlain Read In a Text Document
findMostFreqTerms Find Most Frequent Terms
XMLTextDocument XML Text Documents
XMLSource XML Source
getTokenizers Tokenizers
removeSparseTerms Remove Sparse Terms from a Term-Document Matrix
weightTfIdf Weight by Term Frequency - Inverse Document Frequency
writeCorpus Write a Corpus to Disk
getTransformations Transformations
termFreq Term Frequency Vector
tm_map Transformations on Corpora
plot Visualize a Term-Document Matrix
tm_filter Filter and Index Functions on Corpora
readDOC Read In a MS Word Document
foreign Read Document-Term Matrices
findAssocs Find Associations in a Term-Document Matrix
tm_reduce Combine Transformations
tm_term_score Compute Score for Matching Terms
findFreqTerms Find Frequent Terms
readPDF Read In a PDF Document
URISource Uniform Resource Identifier Source
removeWords Remove Words from a Text Document
VCorpus Volatile Corpora
readDataframe Read In a Text Document from a Data Frame
tokenizer Tokenizers
stemCompletion Complete Stems
weightBin Weight Binary
No Results!

Vignettes of tm

No Results!

Last month downloads


Date 2020-11-17
LinkingTo BH, Rcpp
SystemRequirements C++11
License GPL-3
NeedsCompilation yes
Packaged 2020-11-18 08:39:38 UTC; hornik
Repository CRAN
Date/Publication 2020-11-18 11:13:22 UTC

Include our badge in your README