RDocumentation
Moon
Learn R
Search all packages and functions
⚠️
There's a newer version (0.7-12) of this package.
Take me there.
tm (version 0.7-2)
Text Mining Package
Description
A framework for text mining applications within R.
Copy Link
Copy
Link to current version
Version
Version
0.7-12
0.7-11
0.7-10
0.7-9
0.7-8
0.7-7
0.7-6
0.7-5
0.7-4
0.7-3
0.7-2
0.7-1
0.6-2
0.6-1
0.5-10
0.5-9.1
0.5-8.3
0.5-8.1
0.5-7.1
0.5-6
0.5-5
0.5-4.1
0.5-3
0.5-2
0.5-1
0.4
0.3-4.1
0.3-3
0.3-2
0.3-1
0.2-3.7
0.2-1
0.1-1
Down Chevron
Install
install.packages('tm')
Monthly Downloads
51,138
Version
0.7-2
License
GPL-3
Maintainer
Ingo Feinerer
Last Published
November 18th, 2017
Functions in tm (0.7-2)
Search functions
Corpus
Corpora
TextDocument
Text Documents
Source
Sources
DataframeSource
Data Frame Source
PlainTextDocument
Plain Text Documents
DirSource
Directory Source
Docs
Access Document IDs and Terms
SimpleCorpus
Simple Corpora
Reader
Readers
XMLSource
XML Source
ZipSource
ZIP File Source
PCorpus
Permanent Corpora
VectorSource
Vector Source
VCorpus
Volatile Corpora
WeightFunction
Weighting Function
crude
20 Exemplary News Articles from the Reuters-21578 Data Set of Topic crude
URISource
Uniform Resource Identifier Source
acq
50 Exemplary News Articles from the Reuters-21578 Data Set of Topic acq
findAssocs
Find Associations in a Term-Document Matrix
findMostFreqTerms
Find Most Frequent Terms
tm_combine
Combine Corpora, Documents, Term-Document Matrices, and Term Frequency Vectors
foreign
Read Document-Term Matrices
content_transformer
Content Transformers
Zipf_n_Heaps
Explore Corpus Term Frequency Characteristics
readDataframe
Read In a Text Document from a Data Frame
getTokenizers
Tokenizers
readPDF
Read In a PDF Document
getTransformations
Transformations
readReut21578XML
Read In a Reuters-21578 XML Document
removePunctuation
Remove Punctuation Marks from a Text Document
readTagged
Read In a POS-Tagged Word Text Document
removeSparseTerms
Remove Sparse Terms from a Term-Document Matrix
stripWhitespace
Strip Whitespace from a Text Document
termFreq
Term Frequency Vector
findFreqTerms
Find Frequent Terms
weightTfIdf
Weight by Term Frequency - Inverse Document Frequency
plot
Visualize a Term-Document Matrix
writeCorpus
Write a Corpus to Disk
readDOC
Read In a MS Word Document
weightSMART
SMART Weightings
readXML
Read In an XML Document
weightTf
Weight by Term Frequency
stopwords
Stopwords
removeNumbers
Remove Numbers from a Text Document
tm_reduce
Combine Transformations
tokenizer
Tokenizers
tm_term_score
Compute Score for Matching Terms
XMLTextDocument
XML Text Documents
weightBin
Weight Binary
hpc
Parallelized ‘lapply’
readPlain
Read In a Text Document
inspect
Inspect Objects
readRCV1
Read In a Reuters Corpus Volume 1 Document
TermDocumentMatrix
Term-Document Matrix
meta
Metadata Management
stemDocument
Stem Words
removeWords
Remove Words from a Text Document
stemCompletion
Complete Stems
tm_filter
Filter and Index Functions on Corpora
tm_map
Transformations on Corpora