flare.slim: Sparse Linear Regression using Non-smooth Loss Functions and L1 Regularization

Description

The function "flare.slim" implements a family of Lasso variants for estimating high dimensional sparse linear models including Dantzig Selector, LAD Lasso, SQRT Lasso for estimating high dimensional sparse linear model. We adopt the combination of the dual smoothing and monotone fast iterative soft-thresholding algorithm. The computation is memory-optimized using the sparse matrix output.

Usage

flare.slim(X, Y, lambda = NULL, nlambda = NULL, lambda.min.ratio = NULL,
           method="lq", q = 2, prec = 1e-4, max.ite = 1e4, mu = 0.01,
           intercept = TRUE, verbose = TRUE)

Arguments

The $n$ dimensional response vector.

The $n$ by $d$ design matrix.

lambda

A sequence of decresing positive value to control the regularization. Typical usage is to leave the input lambda = NULL and have the program compute its own lambda sequence based on nlambda and lambda.min.ratio

nlambda

The number of values used in lambda. Default value is 5.

lambda.min.ratio

The smallest value for lambda, as a fraction of the uppperbound (MAX) of the regularization parameter. The program can automatically generate lambda as a sequence of length = nlambda starting from

method

Dantzig selector is applied if method = "dantzig" and $L_q$ Lasso is applied if method = "lq". The default value is "lq".

The loss function used in Lq Lasso. It is only applicable when method = "lq" and must be either 1 or 2. The default value is 2.

prec

Stopping criterion. The default value is 1e-4.

max.ite

The iteration limit. The default value is 1e4.

The smoothing parameter. The default value is 0.01.

intercept

Whether the intercept is included in the model. The defulat value is TRUE.

verbose

Tracing information printing is disabled if verbose = FALSE. The default value is TRUE.

Value

An object with S3 class "flare.slim" is returned:
betaA matrix of regression estimates whose columns correspond to regularization parameters.
interceptThe value of intercepts corresponding to regularization parameters.
YThe value of Y used in the program.
XThe value of X used in the program.
lambdaThe sequence of regularization parameters lambda used in the program.
nlambdaThe number of values used in lambda.
methodThe method from the input.
sparsityThe sparsity levels of the solution path.
iteA list of vectors where ite[[1]] is the number of external iteration and ite[[2]] is the number of internal iteration with the i-th entry corresponding to the i-th regularization parameter.
verboseThe verbose from the input.

Details

Dantzig selector solves the following optimization problem $$\min || \beta ||_1, \quad \textrm{s.t. } || X'(Y - X \beta) ||_{\infty} < \lambda$$ $L_q$ loss Lasso solves the following optimization problem $$\min n^{-\frac{1}{q}}|| Y - X \beta ||_q + \lambda || \beta ||_1$$ where $1

References

1. A. Belloni, V. Chernozhukov and L. Wang. Pivotal recovery of sparse signals via conic programming. Biometrika, 2012. 2. L. Wang. L1 penalized LAD estimator for high dimensional linear regression. Journal of Multivariate Analysis, 2013. 3. E. Candes and T. Tao. The Dantzig selector: Statistical estimation when p is much larger than n. Annals of Statistics, 2007. 4. A. Beck and M. Teboulle. Fast gradient-based algorithms for constrained total variation image denoising and deblurring problems. IEEE Transactions on Image Processing, 2009. 5. B. He and X. Yuan. On non-ergodic convergence rate of Douglas-Rachford alternating direction method of multipliers. Technical Report, 2012. 6. J. Liu and J. Ye. Efficient L1/Lq Norm Regularization. Technical Report, 2010.

Examples

Run this code

## Generate the design matrix and regression coefficient vector
n = 200
d = 400
X = matrix(rnorm(n*d), n, d)
beta = c(3,2,0,1.5,rep(0,d-4))

## Generate response using Gaussian noise, and fit a sparse linear model using SQRT Lasso
eps.sqrt = rnorm(n)
Y.sqrt = X%*%beta + eps.sqrt
out.sqrt = flare.slim(X = X, Y = Y.sqrt, lambda = seq(0.8,0.2,length.out=5))

## Generate response using Cauchy noise, and fit a sparse linear model using LAD Lasso
eps.lad = rt(n = n, df = 1)
Y.lad = X%*%beta + eps.lad
out.lad = flare.slim(X = X, Y = Y.lad, q = 1, lambda = seq(0.5,0.2,length.out=5))

## Visualize the solution path
plot(out.sqrt)
plot(out.lad)

Run the code above in your browser using DataLab