Learn R Programming

⚠️There's a newer version (0.7-16) of this package.Take me there.

tm (version 0.5-7.1)

Text Mining Package

Description

A framework for text mining applications within R.

Copy Link

Version

Install

install.packages('tm')

Monthly Downloads

32,051

Version

0.5-7.1

License

GPL (>= 2)

Maintainer

Ingo Feinerer

Last Published

February 3rd, 2012

Functions in tm (0.5-7.1)

RCV1Document

RCV1 Text Document
foreign

Read Document-Term Matrices
getTransformations

List Available Transformations
meta

Meta Data Management
tm_reduce

Combine Transformations
weightTfIdf

Weight by Term Frequency - Inverse Document Frequency
Reuters21578Document

Reuters-21578 Text Document
as.PlainTextDocument

Create Objects of Class PlainTextDocument
inspect

Inspect Objects
WeightFunction

Weighting Function
materialize

Materialize Lazy Mappings
makeChunks

Split a Corpus into Chunks
number

The Number of Rows/Columns/Dimensions/Documents/Terms of a Term-Document Matrix
DataframeSource

Data Frame Source
VCorpus

Volatile Corpus
FunctionGenerator

Function Generator
GmaneSource

Gmane Source
plot

Visualize a Term-Document Matrix
ReutersSource

Reuters-21578 XML Source
PCorpus

Permanent Corpus Constructor
readPlain

Read In a Text Document
TextDocument

Access and Modify Text Documents
readRCV1

Read In a Reuters Corpus Volume 1 Document
removePunctuation

Remove Punctuation Marks from a Text Document
VectorSource

Vector Source
readDOC

Read In a MS Word Document
acq

50 Exemplary News Articles from the Reuters-21578 XML Data Set of Topic acq
stopwords

Stopwords
crude

20 Exemplary News Articles from the Reuters-21578 XML Data Set of Topic crude
tm_cluster

Allow `tm' to Use a Cluster
TermDocumentMatrix

Term-Document Matrix
stemDocument

Stem Words
searchFullText

Full Text Search
TextRepository

Text Repository
weightBin

Weight Binary
sFilter

Statement Filter
preprocessReut21578XML

Preprocess the Reuters-21578 XML archive.
readGmane

Read In a Gmane RSS Feed
readReut21578XML

Read In a Reuters-21578 XML Document
tm_filter

Filter and Index Functions on Corpora
tm_map

Transformations on Corpora
getReaders

List Available Readers
tm_combine

Combine Corpora, Documents, Term-Document Matrices, and Term Frequency Vectors
tm_intersect

Intersection between Documents and Words
getFilters

List Available Filters
getTokenizers

List Available Tokenizers
findAssocs

Find Associations in a Term-Document Matrix
names

Row, Column, Dim Names, Document IDs, and Terms
readXML

Read In an XML Document
removeSparseTerms

Remove Sparse Terms from a Term-Document Matrix
stripWhitespace

Strip Whitespace from a Text Document
tm_tag_score

Compute a Tag Score
weightSMART

SMART Weightings
writeCorpus

Write a Corpus to Disk
weightTf

Weight by Term Frequency
getSources

List Available Sources
stemCompletion

Complete Stems
Dictionary

Dictionary
PlainTextDocument

Plain Text Document
Source

Access Sources
URISource

Uniform Resource Identifier Source
XMLSource

XML Source
Zipf_n_Heaps

Explore Corpus Term Frequency Characteristics
DirSource

Directory Source
dissimilarity

Dissimilarity
findFreqTerms

Find Frequent Terms
prescindMeta

Prescind Document Meta Data
readTabular

Read In a Text Document
readPDF

Read In a PDF Document
removeWords

Remove Words from a Text Document
removeNumbers

Remove Numbers from a Text Document
tokenizer

Tokenizers
termFreq

Term Frequency Vector