samQL: Training function of Sparse Additive Models

Description

The regression model is learned using training data.

Usage

samQL(X, y, p = 3, lambda = NULL, nlambda = NULL, 
lambda.min.ratio = 5e-3, thol = 1e-05, max.ite = 1e5)

Arguments

The n by d design matrix of the training set, where n is sample size and d is dimension.

The n-dimensional response vector of the training set, where n is sample size.

The number of baisis spline functions. The default value is 3.

lambda

A user supplied lambda sequence. Typical usage is to have the program compute its own lambda sequence based on nlambda and lambda.min.ratio. Supplying a value of lambda overrides this. WARNING: use with care. Do not supply a single value for lambda. Suppl

nlambda

The number of lambda values. The default value is 30.

lambda.min.ratio

Smallest value for lambda, as a fraction of lambda.max, the (data derived) entry value (i.e. the smallest value for which all coefficients are zero). The default is 5e-3.

thol

Stopping precision. The default value is 1e-5.

max.ite

The number of maximum iterations. The default value is 1e5.

Value

pThe number of baisis spline functions used in training.
X.minA vector with each entry corresponding to the minimum of each input variable. (Used for rescaling in testing)
X.ranA vector with each entry corresponding to the range of each input variable. (Used for rescaling in testing)
lambdaA sequence of regularization parameter used in training.
wThe solution path matrix (d*p by length of lambda) with each column corresponding to a regularization parameter. Since we use the basis expansion, the length of each column is d*p.
interceptThe solution path of the intercept.
dfThe degree of freedom of the solution path (The number of non-zero component function)
func_normThe functional norm matrix (d by length of lambda) with each column corresponds to a regularization parameter. Since we have d input variabls, the length of each column is d.
sseSums of square errors of the solution path.

Details

We adopt various computational algorithms including the block coordinate descent, fast iterative soft-thresholding algorithm, and newton method. The computation is further accelerated by "warm-start" and "active-set" tricks.

References

P. Ravikumar, J. Lafferty, H.Liu and L. Wasserman. "Sparse Additive Models", Journal of Royal Statistical Society: Series B, 2009. T. Zhao and H.Liu. "Sparse Additive Machine", International Conference on Artificial Intelligence and Statistics, 2012.

Examples

Run this code

## generating training data
n = 100
d = 500
X = 0.5*matrix(runif(n*d),n,d) + matrix(rep(0.5*runif(n),d),n,d)

## generating response
y = -2*sin(X[,1]) + X[,2]^2-1/3 + X[,3]-1/2 + exp(-X[,4])+exp(-1)-1

## Training
out.trn = samQL(X,y)
out.trn

## plotting solution path
plot(out.trn)

## generating testing data
nt = 1000
Xt = 0.5*matrix(runif(nt*d),nt,d) + matrix(rep(0.5*runif(nt),d),nt,d)

yt = -2*sin(Xt[,1]) + Xt[,2]^2-1/3 + Xt[,3]-1/2 + exp(-Xt[,4])+exp(-1)-1

## predicting response
out.tst = predict(out.trn,Xt)

Run the code above in your browser using DataLab