Learn R Programming

SAM (version 1.0.2)

spamdata: SPAM Detection Dataset

Description

This data set consists of 4601 emails (1813 spam and 2788 regular).

Usage

data(spamdata)

Arguments

format

The format is a list containing conatins a matrix and a vector. 1. fea - 4601x57: 4601 emails with 57 word counts. 2. lab - 4601x1: "1" denotes "spam" and "-1" denotes "regular".

source

It is publicly available at http://archive.ics.uci.edu/ml/datasets/Spambase

Details

This data set can be used to perform the sparse additive machine to demonstrate the supriority of its non-linearity over L1 norm SVM.

References

T. Zhao and H.Liu. "Sparse Additive Machine", International Conference on Artificial Intelligence and Statistics, 2012. P. Bradley and O. Mangasarian. ""Feature selection via concaveminimization and support vector machines"", International Conference on Machine Learing, 1998.

See Also

spam,l1svm

Examples

Run this code
data(spamdata)
dim(spamdata$fea)
sum(spamdata$lab==1)

Run the code above in your browser using DataLab