The Spambase data set was created by Mark Hopkins, Erik Reeber, George Forman,
and Jaap Suermondt at Hewlett-Packard Labs. It includes 4601 observations
corresponding to email messages, 1813 of which are spam. From the original
email messages, 58 different attributes were computed.
Usage
data(spambase)
Arguments
source
This data set is from the UCI Machine Learning Repository. You can find more information
about this data set, including the ciation policy, from
http://archive.ics.uci.edu/ml/datasets/Spambase
Details
This data is used as an example in the book "R in a Nutshell," from O'Reilly Media.
data(spambase)
table(spambase$is_spam)
# fit a linear disciminant analysis model to the datalibrary(MASS)
spam.lda <- qda(formula=is_spam~., data=spambase)