Learn R Programming

⚠️There's a newer version (0.7-18) of this package.Take me there.

tm (version 0.5-7.1)

Text Mining Package

Description

A framework for text mining applications within R.

Copy Link

Version

Install

install.packages('tm')

Monthly Downloads

39,756

Version

0.5-7.1

License

GPL (>= 2)

Maintainer

Ingo Feinerer

Last Published

February 3rd, 2012

Functions in tm (0.5-7.1)

RCV1 Text Document

Read Document-Term Matrices

getTransformations

List Available Transformations

Meta Data Management

Combine Transformations

Weight by Term Frequency - Inverse Document Frequency

Reuters21578Document

Reuters-21578 Text Document

as.PlainTextDocument

Create Objects of Class PlainTextDocument

Inspect Objects

Weighting Function

Materialize Lazy Mappings

Split a Corpus into Chunks

The Number of Rows/Columns/Dimensions/Documents/Terms of a Term-Document Matrix

DataframeSource

Data Frame Source

Volatile Corpus

FunctionGenerator

Function Generator

Visualize a Term-Document Matrix

Reuters-21578 XML Source

Permanent Corpus Constructor

Read In a Text Document

Access and Modify Text Documents

Read In a Reuters Corpus Volume 1 Document

removePunctuation

Remove Punctuation Marks from a Text Document

Read In a MS Word Document

50 Exemplary News Articles from the Reuters-21578 XML Data Set of Topic acq

20 Exemplary News Articles from the Reuters-21578 XML Data Set of Topic crude

Allow `tm' to Use a Cluster

TermDocumentMatrix

Term-Document Matrix

Full Text Search

Text Repository

Statement Filter

preprocessReut21578XML

Preprocess the Reuters-21578 XML archive.

Read In a Gmane RSS Feed

readReut21578XML

Read In a Reuters-21578 XML Document

Filter and Index Functions on Corpora

Transformations on Corpora

List Available Readers

Combine Corpora, Documents, Term-Document Matrices, and Term Frequency Vectors

Intersection between Documents and Words

List Available Filters

List Available Tokenizers

Find Associations in a Term-Document Matrix

Row, Column, Dim Names, Document IDs, and Terms

Read In an XML Document

removeSparseTerms

Remove Sparse Terms from a Term-Document Matrix

stripWhitespace

Strip Whitespace from a Text Document

Compute a Tag Score

SMART Weightings

Write a Corpus to Disk

Weight by Term Frequency

List Available Sources

PlainTextDocument

Plain Text Document

Uniform Resource Identifier Source

Explore Corpus Term Frequency Characteristics

Directory Source

Find Frequent Terms

Prescind Document Meta Data

Read In a Text Document

Read In a PDF Document

Remove Words from a Text Document

Remove Numbers from a Text Document

Term Frequency Vector