Learn R Programming

flare (version 1.8)

sugm: High-dimensional Sparse Undirected Graphical Models

Description

The function "sugm" estimates sparse undirected graphical models (Gaussian precision matrices) in high dimensions. Two procedures are implemented using a column-wise regression scheme: (1) Tuning-Insensitive Graph Estimation and Regression based on square-root Lasso ("tiger"); and (2) The Constrained L1 Minimization for Sparse Precision Matrix Estimation ("clime"). The optimization algorithm is based on the alternating direction method of multipliers (ADMM), linearization, and multi-stage screening. Missing values can be tolerated for CLIME when the input is a data matrix. Computation is memory-optimized using sparse matrix output.

Usage

sugm(data, lambda = NULL, nlambda = NULL, lambda.min.ratio = NULL, 
     rho = NULL, method = "tiger", sym = "or", shrink = NULL, 
     prec = 1e-4, max.ite = 1e4, standardize = FALSE, 
     perturb = TRUE, verbose = TRUE)

Value

An object with S3 class "sugm" is returned:

data

The n by d data matrix or d by d sample covariance matrix from the input.

cov.input

An indicator of the sample covariance.

lambda

The sequence of regularization parameters lambda used in the program.

nlambda

The number of values used in lambda.

icov

A list of d by d precision matrices corresponding to regularization parameters.

sym

The sym from the input.

method

The method from the input.

path

A list of d by d adjacency matrices of estimated graphs as a graph path corresponding to lambda.

sparsity

The sparsity levels of the graph path.

ite

Iteration counts returned by the underlying optimization solver.

df

A d by nlambda matrix containing nonzero counts along the estimated path.

standardize

The standardize from the input.

perturb

The perturb from the input.

verbose

The verbose from the input.

Arguments

data

There are two options for "clime": (1) an n by d data matrix, or (2) a d by d sample covariance matrix. The input type is identified by symmetry. For "tiger", covariance input is not supported and d\( \ge \)3 is required. For "clime", d\( \ge \)2 is required.

lambda

A sequence of decreasing, positive, finite numbers controlling regularization. Typical usage is lambda = NULL, in which case the sequence is generated from nlambda and lambda.min.ratio.

nlambda

The number of values used in lambda. Default value is 5.

lambda.min.ratio

The minimum value of generated lambda as a fraction of lambda.max. The default value is 0.4 for both "tiger" and "clime".

rho

Penalty parameter used in the optimization algorithm. The default value is 1.

method

"tiger" is applied if method = "tiger" and "clime" is applied if method="clime". Default value is "tiger".

sym

Symmetrization of output graphs. If sym = "and", the edge between node i and node j is selected ONLY when both node i and node j are selected as neighbors for each other. If sym = "or", the edge is selected when either node i or node j is selected as the neighbor for each other. The default value is "or".

shrink

Shrinkage of the regularization parameter based on estimation precision. The default value is 0.

prec

Stopping criterion. The default value is 1e-4.

max.ite

The iteration limit. The default value is 1e4.

standardize

Variables are standardized to have mean zero and unit standard deviation if standardize = TRUE. The default value is FALSE.

perturb

For "clime", if TRUE, adds \(1/\sqrt{n}\) to the diagonal of Sigma; if FALSE, no perturbation is added; a numeric value can also be supplied directly. The default value is TRUE.

verbose

Tracing information printing is disabled if verbose = FALSE. The default value is TRUE.

Author

Xingguo Li, Tuo Zhao, Lie Wang, Xiaoming Yuan and Han Liu
Maintainer: Tuo Zhao <tourzhao@gatech.edu>

Details

CLIME solves the following minimization problem
$$ \min || \Omega ||_1 \quad \textrm{s.t. } || S \Omega - I ||_\infty \le \lambda, $$
where \(||\cdot||_1\) and \(||\cdot||_\infty\) are element-wise 1-norm and \(\infty\)-norm respectively.

"tiger" solves the following minimization problem
$$ \min ||X-XB||_{2,1} + \lambda ||B||_1 \quad \textrm{s.t. } B_{jj} = 0, $$
where \(||\cdot||_{1}\) and \(||\cdot||_{2,1}\) are element-wise 1-norm and \(L_{2,1}\)-norm respectively.

References

1. T. Cai, W. Liu and X. Luo. A constrained L1 minimization approach to sparse precision matrix estimation. Journal of the American Statistical Association, 2011.
2. H. Liu, L. Wang. TIGER: A tuning-insensitive approach for optimally estimating large undirected graphs. Technical Report, 2012.
3. B. He and X. Yuan. On non-ergodic convergence rate of Douglas-Rachford alternating direction method of multipliers. Technical Report, 2012.

See Also

flare-package, sugm.generator, sugm.select, sugm.plot, sugm.roc, plot.sugm, plot.select, plot.roc, plot.sim, print.sugm, print.select, print.roc and print.sim.

Examples

Run this code

## load package required
library(flare)

## generating data
n = 50
d = 50
D = sugm.generator(n=n,d=d,graph="band",g=1)
plot(D)

## sparse precision matrix estimation with method "clime"
out1 = sugm(D$data, method = "clime")
plot(out1)
sugm.plot(out1$path[[4]])

## sparse precision matrix estimation with method "tiger"
out2 = sugm(D$data, method = "tiger")
plot(out2)
sugm.plot(out2$path[[5]])

Run the code above in your browser using DataLab