Learn R Programming

nutshell (version 1.02)

spambase: Spambase Data Set

Description

The Spambase data set was created by Mark Hopkins, Erik Reeber, George Forman, and Jaap Suermondt at Hewlett-Packard Labs. It includes 4601 observations corresponding to email messages, 1813 of which are spam. From the original email messages, 58 different attributes were computed.

Usage

data(spambase)

Arguments

source

This data set is from the UCI Machine Learning Repository. You can find more information about this data set, including the ciation policy, from http://archive.ics.uci.edu/ml/datasets/Spambase

Details

This data is used as an example in the book "R in a Nutshell," from O'Reilly Media.

Examples

Run this code
data(spambase)
table(spambase$is_spam)
# fit a linear disciminant analysis model to the data
library(MASS)
spam.lda <- qda(formula=is_spam~., data=spambase)

Run the code above in your browser using DataLab