Rdocumentation
powered by
Learn R Programming
⚠️
There's a newer version (0.7-14) of this package.
Take me there.
tm (version 0.7-12)
Text Mining Package
Description
A framework for text mining applications within R.
Copy Link
Link to current version
Version
Version
0.7-14
0.7-12
0.7-11
0.7-10
0.7-9
0.7-8
0.7-7
0.7-6
0.7-5
0.7-4
0.7-3
0.7-2
0.7-1
0.6-2
0.6-1
0.5-10
0.5-9.1
0.5-8.3
0.5-8.1
0.5-7.1
0.5-6
0.5-5
0.5-4.1
0.5-3
0.5-2
0.5-1
0.4
0.3-4.1
0.3-3
0.3-2
0.3-1
0.2-3.7
0.2-1
0.1-1
Install
install.packages('tm')
Monthly Downloads
61,449
Version
0.7-12
License
GPL-3
Maintainer
Ingo Feinerer
Last Published
March 11th, 2024
Functions in tm (0.7-12)
Search all functions
findAssocs
Find Associations in a Term-Document Matrix
XMLTextDocument
XML Text Documents
meta
Metadata Management
inspect
Inspect Objects
TextDocument
Text Documents
removeWords
Remove Words from a Text Document
XMLSource
XML Source
TermDocumentMatrix
Term-Document Matrix
hpc
Parallelized ‘lapply’
VectorSource
Vector Source
stemCompletion
Complete Stems
findFreqTerms
Find Frequent Terms
plot
Visualize a Term-Document Matrix
readDOC
Read In a MS Word Document
ZipSource
ZIP File Source
readDataframe
Read In a Text Document from a Data Frame
Zipf_n_Heaps
Explore Corpus Term Frequency Characteristics
WeightFunction
Weighting Function
tm_filter
Filter and Index Functions on Corpora
weightBin
Weight Binary
tokenizer
Tokenizers
readPDF
Read In a PDF Document
getTokenizers
Tokenizers
getTransformations
Transformations
VCorpus
Volatile Corpora
URISource
Uniform Resource Identifier Source
SimpleCorpus
Simple Corpora
findMostFreqTerms
Find Most Frequent Terms
tm_map
Transformations on Corpora
stemDocument
Stem Words
readRCV1
Read In a Reuters Corpus Volume 1 Document
tm_combine
Combine Corpora, Documents, Term-Document Matrices, and Term Frequency Vectors
content_transformer
Content Transformers
acq
50 Exemplary News Articles from the Reuters-21578 Data Set of Topic acq
readPlain
Read In a Text Document
crude
20 Exemplary News Articles from the Reuters-21578 Data Set of Topic crude
stripWhitespace
Strip Whitespace from a Text Document
termFreq
Term Frequency Vector
readTagged
Read In a POS-Tagged Word Text Document
readReut21578XML
Read In a Reuters-21578 XML Document
foreign
Read Document-Term Matrices
readXML
Read In an XML Document
weightSMART
SMART Weightings
tm_reduce
Combine Transformations
tm_term_score
Compute Score for Matching Terms
removePunctuation
Remove Punctuation Marks from a Text Document
stopwords
Stopwords
weightTf
Weight by Term Frequency
removeNumbers
Remove Numbers from a Text Document
removeSparseTerms
Remove Sparse Terms from a Term-Document Matrix
writeCorpus
Write a Corpus to Disk
weightTfIdf
Weight by Term Frequency - Inverse Document Frequency
DirSource
Directory Source
Docs
Access Document IDs and Terms
Reader
Readers
Corpus
Corpora
PCorpus
Permanent Corpora
DataframeSource
Data Frame Source
Source
Sources
PlainTextDocument
Plain Text Documents