Learn R Programming

⚠️There's a newer version (0.7-16) of this package.Take me there.

tm (version 0.7-8)

Text Mining Package

Description

A framework for text mining applications within R.

Copy Link

Version

Install

install.packages('tm')

Monthly Downloads

60,856

Version

0.7-8

License

GPL-3

Maintainer

Ingo Feinerer

Last Published

November 18th, 2020

Functions in tm (0.7-8)

removePunctuation

Remove Punctuation Marks from a Text Document
DirSource

Directory Source
Docs

Access Document IDs and Terms
ZipSource

ZIP File Source
Zipf_n_Heaps

Explore Corpus Term Frequency Characteristics
hpc

Parallelized ‘lapply’
inspect

Inspect Objects
PCorpus

Permanent Corpora
SimpleCorpus

Simple Corpora
content_transformer

Content Transformers
stemDocument

Stem Words
acq

50 Exemplary News Articles from the Reuters-21578 Data Set of Topic acq
PlainTextDocument

Plain Text Documents
VectorSource

Vector Source
stopwords

Stopwords
WeightFunction

Weighting Function
weightSMART

SMART Weightings
TermDocumentMatrix

Term-Document Matrix
meta

Metadata Management
weightTf

Weight by Term Frequency
readRCV1

Read In a Reuters Corpus Volume 1 Document
stripWhitespace

Strip Whitespace from a Text Document
Corpus

Corpora
readXML

Read In an XML Document
removeNumbers

Remove Numbers from a Text Document
DataframeSource

Data Frame Source
readTagged

Read In a POS-Tagged Word Text Document
crude

20 Exemplary News Articles from the Reuters-21578 Data Set of Topic crude
readReut21578XML

Read In a Reuters-21578 XML Document
tm_combine

Combine Corpora, Documents, Term-Document Matrices, and Term Frequency Vectors
Reader

Readers
TextDocument

Text Documents
Source

Sources
readPlain

Read In a Text Document
findMostFreqTerms

Find Most Frequent Terms
XMLTextDocument

XML Text Documents
XMLSource

XML Source
getTokenizers

Tokenizers
removeSparseTerms

Remove Sparse Terms from a Term-Document Matrix
weightTfIdf

Weight by Term Frequency - Inverse Document Frequency
writeCorpus

Write a Corpus to Disk
getTransformations

Transformations
termFreq

Term Frequency Vector
tm_map

Transformations on Corpora
plot

Visualize a Term-Document Matrix
tm_filter

Filter and Index Functions on Corpora
readDOC

Read In a MS Word Document
foreign

Read Document-Term Matrices
findAssocs

Find Associations in a Term-Document Matrix
tm_reduce

Combine Transformations
tm_term_score

Compute Score for Matching Terms
findFreqTerms

Find Frequent Terms
readPDF

Read In a PDF Document
URISource

Uniform Resource Identifier Source
removeWords

Remove Words from a Text Document
VCorpus

Volatile Corpora
readDataframe

Read In a Text Document from a Data Frame
tokenizer

Tokenizers
stemCompletion

Complete Stems
weightBin

Weight Binary