Canonical correlation analysis that is scalable to high dimensional data. Uses covariance shrinkage and algorithmic speed ups to be linear time in p when p &gt; n.

internal

Data whitening is a widely used preprocessing step to remove correlation structure since statistical models often assume independence. Here we use a probabilistic model of the observed data to apply a whitening transformation. This Gaussian Inverse Wishart Empirical Bayes model substantially reduces computational complexity, and regularizes the eigen-values of the sample covariance matrix to improve out-of-sample performance.

Gabriel E Hoffman

decorrelate

Decorrelation Projection Scalable to High Dimensional Data

Gabriel Hoffman

cca function

<dl><dt>X</dt>
<dd>first matrix (n x p1)</dd>
<dt>Y</dt>
<dd>first matrix (n x p2)</dd>
<dt>k</dt>
<dd>number of canonical components to return</dd>
<dt>lambda.x</dt>
<dd>optional shrinkage parameter for estimating covariance of X. If NULL, estimate from data.</dd>
<dt>lambda.y</dt>
<dd>optional shrinkage parameter for estimating covariance of Y. If NULL, estimate from data.</dd></dl>

Arguments

Canonical correlation analysis — cca

<dl>

<dt>X</dt>
<dd>first matrix (n x p1)</dd>


<dt>Y</dt>
<dd>first matrix (n x p2)</dd>


<dt>k</dt>
<dd>number of canonical components to return</dd>


<dt>lambda.x</dt>
<dd>optional shrinkage parameter for estimating covariance of X. If NULL, estimate from data.</dd>


<dt>lambda.y</dt>
<dd>optional shrinkage parameter for estimating covariance of Y. If NULL, estimate from data.</dd>

</dl>

cca: Canonical correlation analysis

Description

Usage

Value

Arguments

Details