tm v0.7-8

0

Monthly downloads

0th

Percentile

Text Mining Package

A framework for text mining applications within R.

Functions in tm

Name Description
removePunctuation Remove Punctuation Marks from a Text Document
DirSource Directory Source
Docs Access Document IDs and Terms
ZipSource ZIP File Source
Zipf_n_Heaps Explore Corpus Term Frequency Characteristics
hpc Parallelized ‘lapply’
inspect Inspect Objects
PCorpus Permanent Corpora
SimpleCorpus Simple Corpora
content_transformer Content Transformers
stemDocument Stem Words
acq 50 Exemplary News Articles from the Reuters-21578 Data Set of Topic acq
PlainTextDocument Plain Text Documents
VectorSource Vector Source
stopwords Stopwords
WeightFunction Weighting Function
weightSMART SMART Weightings
TermDocumentMatrix Term-Document Matrix
meta Metadata Management
weightTf Weight by Term Frequency
readRCV1 Read In a Reuters Corpus Volume 1 Document
stripWhitespace Strip Whitespace from a Text Document
Corpus Corpora
readXML Read In an XML Document
removeNumbers Remove Numbers from a Text Document
DataframeSource Data Frame Source
readTagged Read In a POS-Tagged Word Text Document
crude 20 Exemplary News Articles from the Reuters-21578 Data Set of Topic crude
readReut21578XML Read In a Reuters-21578 XML Document
tm_combine Combine Corpora, Documents, Term-Document Matrices, and Term Frequency Vectors
Reader Readers
TextDocument Text Documents
Source Sources
readPlain Read In a Text Document
findMostFreqTerms Find Most Frequent Terms
XMLTextDocument XML Text Documents
XMLSource XML Source
getTokenizers Tokenizers
removeSparseTerms Remove Sparse Terms from a Term-Document Matrix
weightTfIdf Weight by Term Frequency - Inverse Document Frequency
writeCorpus Write a Corpus to Disk
getTransformations Transformations
termFreq Term Frequency Vector
tm_map Transformations on Corpora
plot Visualize a Term-Document Matrix
tm_filter Filter and Index Functions on Corpora
readDOC Read In a MS Word Document
foreign Read Document-Term Matrices
findAssocs Find Associations in a Term-Document Matrix
tm_reduce Combine Transformations
tm_term_score Compute Score for Matching Terms
findFreqTerms Find Frequent Terms
readPDF Read In a PDF Document
URISource Uniform Resource Identifier Source
removeWords Remove Words from a Text Document
VCorpus Volatile Corpora
readDataframe Read In a Text Document from a Data Frame
tokenizer Tokenizers
stemCompletion Complete Stems
weightBin Weight Binary
No Results!

Vignettes of tm

Name
extensions.Rnw
references.bib
tm.Rnw
No Results!

Last month downloads

Details

Date 2020-11-17
LinkingTo BH, Rcpp
SystemRequirements C++11
License GPL-3
URL http://tm.r-forge.r-project.org/
Additional_repositories https://datacube.wu.ac.at
NeedsCompilation yes
Packaged 2020-11-18 08:39:38 UTC; hornik
Repository CRAN
Date/Publication 2020-11-18 11:13:22 UTC

Include our badge in your README

[![Rdoc](http://www.rdocumentation.org/badges/version/tm)](http://www.rdocumentation.org/packages/tm)