data_bookReviews

This is a subset of the data used in the paper, which
was assembled by Prettenhofer and Stein (2010). It contains 1000 reviews
of books on Amazon, of which 500 were selected from the original training data
and 500 from the test data.
The full dataset has been used for a variety of things, including
classification using svm. The subset was chosen small enough to keep the computation
time low, while still containing the examples in the paper.

Tools to visualize the results of a classification of cases.
The graphical displays include stacked plots, silhouette plots, quasi residual plots, and class maps.
Implements the techniques described and illustrated in Raymaekers, Rousseeuw and Hubert (2021), Class maps for visualizing classification results, Technometrics, appeared online.
<doi:10.1080/00401706.2021.1927849> (open access) and Raymaekers and Rousseeuw (2021),
Silhouettes and quasi residual plots for neural nets and tree-based classifiers,
<arXiv:2106.08814>. Examples can be found in the vignettes:
"Discriminant_analysis_examples","K_nearest_neighbors_examples",
"Support_vector_machine_examples", "Rpart_examples", "Random_forest_examples",
and "Neural_net_examples".

Jakob Raymaekers

classmap

Visualizing Classification Results

Peter Rousseeuw

data_bookReviews function

A data frame with 1000 observations on the following 2 variables.<dl>
 <dt><code>review</code></dt>
<dd>the review in text format (character)</dd> <dt><code>sentiment</code></dt>
<dd>factor indicating the sentiment of the review: negative (1) or positive (2)</dd> 
</dl>

Format

Amazon book reviews data — data_bookReviews

A data frame with 1000 observations on the following 2 variables.<dl>
 <dt><code>review</code></dt>
<dd>the review in text format (character)</dd>

 <dt><code>sentiment</code></dt>
<dd>factor indicating the sentiment of the review: negative (1) or positive (2)</dd>

 
</dl>

data_bookReviews: Amazon book reviews data

Description

Usage

Arguments

Format

Examples