Rdocumentation
powered by
Learn R Programming
⚠️
There's a newer version (0.7-16) of this package.
Take me there.
tm (version 0.5-4.1)
Text Mining Package
Description
A framework for text mining applications within R.
Copy Link
Link to current version
Version
Version
0.7-16
0.7-15
0.7-14
0.7-12
0.7-11
0.7-10
0.7-9
0.7-8
0.7-7
0.7-6
0.7-5
0.7-4
0.7-3
0.7-2
0.7-1
0.6-2
0.6-1
0.5-10
0.5-9.1
0.5-8.3
0.5-8.1
0.5-7.1
0.5-6
0.5-5
0.5-4.1
0.5-3
0.5-2
0.5-1
0.4
0.3-4.1
0.3-3
0.3-2
0.3-1
0.2-3.7
0.2-1
0.1-1
Install
install.packages('tm')
Monthly Downloads
37,471
Version
0.5-4.1
License
GPL (>= 2)
Maintainer
Ingo Feinerer
Last Published
September 27th, 2010
Functions in tm (0.5-4.1)
Search all functions
FunctionGenerator
Function Generator
VCorpus
Volatile Corpus
XMLSource
XML Source
getTransformations
List Available Transformations
crude
20 Exemplary News Articles from the Reuters-21578 XML Data Set of Topic crude
Reuters21578Document
Reuters-21578 Text Document
stemCompletion
Complete Stems
TextDocument
Access and Modify Text Documents
as.PlainTextDocument
Create Objects of Class PlainTextDocument
RCV1Document
RCV1 Text Document
inspect
Inspect Objects
dissimilarity
Dissimilarity
tm_cluster
Allow `tm' to Use a Cluster
GmaneSource
Gmane Source
materialize
Materialize Lazy Mappings
tm_map
Transformations on Corpora
DirSource
Directory Source
tm_combine
Combine Corpora, Documents, and Term-Document Matrices
ReutersSource
Reuters-21578 XML Source
preprocessReut21578XML
Preprocess the Reuters-21578 XML archive.
number
The Number of Rows/Columns/Dimensions/Documents/Terms of a Term-Document Matrix
tm_tag_score
Compute a Tag Score
removeNumbers
Remove Numbers from a Text Document
findAssocs
Find Associations in a Term-Document Matrix
acq
50 Exemplary News Articles from the Reuters-21578 XML Data Set of Topic acq
getFilters
List Available Filters
stemDocument
Stem Words
searchFullText
Full Text Search
makeChunks
Split a Corpus into Chunks
stripWhitespace
Strip Whitespace from a Text Document
getReaders
List Available Readers
prescindMeta
Prescind Document Meta Data
readDOC
Read In a MS Word Document
getSources
List Available Sources
readXML
Read In an XML Document
DataframeSource
Data Frame Source
readGmane
Read In a Gmane RSS Feed
Dictionary
Dictionary
tm_intersect
Intersection between Documents and Words
stopwords
Multilingual Stopwords
URISource
Uniform Resource Identifier Source
readPDF
Read In a PDF Document
TermDocumentMatrix
Term-Document Matrix
Source
Access Sources
PCorpus
Permanent Corpus Constructor
plot
Visualize a Term-Document Matrix
readTabular
Read In a Text Document
sFilter
Statement Filter
findFreqTerms
Find Frequent Terms
removeSparseTerms
Remove Sparse Terms from a Term-Document Matrix
TextRepository
Text Repository
weightBin
Weight Binary
readPlain
Read In a Text Document
names
Row, Column, Dim Names, Document IDs, and Terms
writeCorpus
Write a Corpus to Disk
weightTf
Weight by Term Frequency
WeightFunction
Weighting Function
VectorSource
Vector Source
weightTfIdf
Weight by Term Frequency - Inverse Document Frequency
tm_filter
Filter and Index Functions on Corpora
Zipf_n_Heaps
Explore Corpus Term Frequency Characteristics
removePunctuation
Remove Punctuation Marks from a Text Document
readReut21578XML
Read In a Reuters-21578 XML Document
meta
Meta Data Management
PlainTextDocument
Plain Text Document
removeWords
Remove Words from a Text Document
readRCV1
Read In a Reuters Corpus Volume 1 Document
tm_reduce
Combine Transformations
termFreq
Term Frequency Vector