Rdocumentation
powered by
Learn R Programming
⚠️
There's a newer version (0.7-16) of this package.
Take me there.
tm (version 0.5-7.1)
Text Mining Package
Description
A framework for text mining applications within R.
Copy Link
Link to current version
Version
Version
0.7-16
0.7-15
0.7-14
0.7-12
0.7-11
0.7-10
0.7-9
0.7-8
0.7-7
0.7-6
0.7-5
0.7-4
0.7-3
0.7-2
0.7-1
0.6-2
0.6-1
0.5-10
0.5-9.1
0.5-8.3
0.5-8.1
0.5-7.1
0.5-6
0.5-5
0.5-4.1
0.5-3
0.5-2
0.5-1
0.4
0.3-4.1
0.3-3
0.3-2
0.3-1
0.2-3.7
0.2-1
0.1-1
Install
install.packages('tm')
Monthly Downloads
32,051
Version
0.5-7.1
License
GPL (>= 2)
Maintainer
Ingo Feinerer
Last Published
February 3rd, 2012
Functions in tm (0.5-7.1)
Search all functions
RCV1Document
RCV1 Text Document
foreign
Read Document-Term Matrices
getTransformations
List Available Transformations
meta
Meta Data Management
tm_reduce
Combine Transformations
weightTfIdf
Weight by Term Frequency - Inverse Document Frequency
Reuters21578Document
Reuters-21578 Text Document
as.PlainTextDocument
Create Objects of Class PlainTextDocument
inspect
Inspect Objects
WeightFunction
Weighting Function
materialize
Materialize Lazy Mappings
makeChunks
Split a Corpus into Chunks
number
The Number of Rows/Columns/Dimensions/Documents/Terms of a Term-Document Matrix
DataframeSource
Data Frame Source
VCorpus
Volatile Corpus
FunctionGenerator
Function Generator
GmaneSource
Gmane Source
plot
Visualize a Term-Document Matrix
ReutersSource
Reuters-21578 XML Source
PCorpus
Permanent Corpus Constructor
readPlain
Read In a Text Document
TextDocument
Access and Modify Text Documents
readRCV1
Read In a Reuters Corpus Volume 1 Document
removePunctuation
Remove Punctuation Marks from a Text Document
VectorSource
Vector Source
readDOC
Read In a MS Word Document
acq
50 Exemplary News Articles from the Reuters-21578 XML Data Set of Topic acq
stopwords
Stopwords
crude
20 Exemplary News Articles from the Reuters-21578 XML Data Set of Topic crude
tm_cluster
Allow `tm' to Use a Cluster
TermDocumentMatrix
Term-Document Matrix
stemDocument
Stem Words
searchFullText
Full Text Search
TextRepository
Text Repository
weightBin
Weight Binary
sFilter
Statement Filter
preprocessReut21578XML
Preprocess the Reuters-21578 XML archive.
readGmane
Read In a Gmane RSS Feed
readReut21578XML
Read In a Reuters-21578 XML Document
tm_filter
Filter and Index Functions on Corpora
tm_map
Transformations on Corpora
getReaders
List Available Readers
tm_combine
Combine Corpora, Documents, Term-Document Matrices, and Term Frequency Vectors
tm_intersect
Intersection between Documents and Words
getFilters
List Available Filters
getTokenizers
List Available Tokenizers
findAssocs
Find Associations in a Term-Document Matrix
names
Row, Column, Dim Names, Document IDs, and Terms
readXML
Read In an XML Document
removeSparseTerms
Remove Sparse Terms from a Term-Document Matrix
stripWhitespace
Strip Whitespace from a Text Document
tm_tag_score
Compute a Tag Score
weightSMART
SMART Weightings
writeCorpus
Write a Corpus to Disk
weightTf
Weight by Term Frequency
getSources
List Available Sources
stemCompletion
Complete Stems
Dictionary
Dictionary
PlainTextDocument
Plain Text Document
Source
Access Sources
URISource
Uniform Resource Identifier Source
XMLSource
XML Source
Zipf_n_Heaps
Explore Corpus Term Frequency Characteristics
DirSource
Directory Source
dissimilarity
Dissimilarity
findFreqTerms
Find Frequent Terms
prescindMeta
Prescind Document Meta Data
readTabular
Read In a Text Document
readPDF
Read In a PDF Document
removeWords
Remove Words from a Text Document
removeNumbers
Remove Numbers from a Text Document
tokenizer
Tokenizers
termFreq
Term Frequency Vector