spamdata

This data set consists of 4601 emails (1813 spam and 2788 regular).

The package SAM targets at high dimensional classification
        problem for complex data analysis. SAM is short for sparse
        additive machine, which is a explicit combination of high
        dimensional sparse additive modeling and support vector
        machine. Different from existing non-linear classification
        methods, which usually use, SAM adopts the computationally
        efficient basis spline technique. The optimization is solved by
        the Linearized Alternative Direction Method of Multipliers
        (L-ADMM). The computation is further accelerated by warm-start
        and active-set tricks. For users who are interested in
        large-scale problems, we also provide an implementation of L1
        norm SVM for computational convenience.

Tuo Zhao

Sparse Additive Machine

spamdata function

The format is a list containing conatins a matrix and a vector.
  1. fea - 4601x57: 4601 emails with 57 word counts.
  2. lab - 4601x1: "1" denotes "spam" and "-1" denotes "regular".

format

It is publicly available at http://archive.ics.uci.edu/ml/datasets/Spambase

spamdata: SPAM Detection Dataset

Description

Usage

Arguments

format

source

Details

References

See Also

Examples