Learn R Programming

⚠️There's a newer version (1.4.3) of this package.Take me there.

RTextTools (version 1.3.8)

Automatic Text Classification via Supervised Learning

Description

RTextTools is a machine learning package for automatic text classification that makes it simple for novice users to get started with machine learning, while allowing experienced users to easily experiment with different settings and algorithm combinations. The package includes nine algorithms for ensemble classification (svm, slda, boosting, bagging, random forests, glmnet, decision trees, neural networks, maximum entropy), comprehensive analytics, and thorough documentation.

Copy Link

Version

Install

install.packages('RTextTools')

Monthly Downloads

509

Version

1.3.8

License

GPL-3

Maintainer

Timothy P Jurka

Last Published

May 13th, 2012

Functions in RTextTools (1.3.8)

cross_validate

used for cross-validation of various algorithms.
train_model

makes a model object using the specified algorithm.
recall_accuracy

calculates the recall accuracy of the classified data.
train_models

makes a model object using the specified algorithms.
USCongress

a sample dataset containing labeled bills from the United State Congress.
read_data

reads data from files into an R data frame.
print_algorithms

prints available algorithms for train_model() and train_models().
NYTimes

a sample dataset containing labeled headlines from The New York Times.
analytics_container-class

an S4 class containing the analytics for a classified set of documents.
create_scoreSummary

creates a summary with the best label for each document.
create_matrix

creates a document-term matrix to be passed into create_corpus().
create_precisionRecallSummary

creates a summary with precision, recall, and F1 scores.
wordStem

Get the common root/stem of words
create_analytics

creates an object of class analytics given classification results.
matrix_container-class

an S4 class containing the training and classification matrices.
getStemLanguages

Query the languages supported in this package
wizard_train_classify

a simplified function for training and classifying data.
analytics_container_virgin-class

an S4 class containing the analytics for a classified set of documents.
classify_models

makes predictions from a train_models() object.
create_corpus

creates a corpus for training, classifying, and analyzing documents.
classify_model

makes predictions from a train_model() object.
create_ensembleSummary

creates a summary with ensemble coverage and precision.
wizard_read_data

a simplified function for reading data from files.