spamdata: SPAM Detection Dataset
Description
This data set consists of 4601 emails (1813 spam and 2788 regular).format
The format is a list containing conatins a matrix and a vector.
1. fea - 4601x57: 4601 emails with 57 word counts.
2. lab - 4601x1: "1" denotes "spam" and "-1" denotes "regular".source
It is publicly available at http://archive.ics.uci.edu/ml/datasets/SpambaseDetails
This data set can be used to perform the sparse additive machine to demonstrate the supriority of its non-linearity over L1 norm SVM.References
T. Zhao and H.Liu. "Sparse Additive Machine", International Conference on Artificial Intelligence and Statistics, 2012.
P. Bradley and O. Mangasarian. ""Feature selection via concaveminimization and support vector machines"", International Conference on Machine Learing, 1998.Examples
Run this codedata(spamdata)
dim(spamdata$fea)
sum(spamdata$lab==1)
Run the code above in your browser using DataLab