mixed.solve: Mixed-model solver

Description

Calculates maximum-likelihood (ML/REML) solutions for mixed models of the form $$y = X \beta + Z u + \varepsilon$$ where $\beta$ is a vector of fixed effects and $u$ is a vector of random effects with $Var[u] = K \sigma^2_u$. The residual variance is $Var[\varepsilon] = I \sigma^2_e$. This class of mixed models, in which there is a single variance component other than the residual error, has a close relationship with ridge regression (ridge parameter $\lambda = \sigma_e^2 / \sigma^2_u$).

Usage

mixed.solve(y, Z=NULL, K=NULL, X=NULL, method="REML", 
        bounds=c(1e-09, 1e+09), SE=FALSE, return.Hinv=FALSE)

Arguments

Vector ($n \times 1$) of observations. Missing values (NA) are omitted, along with the corresponding rows of X and Z.

Design matrix ($n \times m$) for the random effects. If not passed, assumed to be the identity matrix.

Covariance matrix ($m \times m$) for random effects; must be positive semi-definite. If not passed, assumed to be the identity matrix.

Design matrix ($n \times p$) for the fixed effects. If not passed, a vector of 1's is used to model the intercept.

method

Specifies whether the full ("ML") or restricted ("REML") maximum-likelihood method is used.

bounds

Array with two elements specifying the lower and upper bound for the ridge parameter.

If TRUE, standard errors are calculated.

return.Hinv

If TRUE, the function returns the inverse of $H = Z K Z' + \lambda I$. This is useful for GWA.

Value

If SE=FALSE, the function returns a list containing [object Object],[object Object],[object Object],[object Object],[object Object] If SE=TRUE, the list also contains [object Object],[object Object] If return.Hinv=TRUE, the list also contains [object Object]

Details

This function can be used to predict marker effects or breeding values (see examples). The numerical method is based on the spectral decomposition of $Z K Z'$ and $S Z K Z' S$, where $S = I - X (X' X)^{-1} X'$ is the projection operator for the nullspace of $X$ (Kang et al., 2008). For marker effects where K = I, the function will run faster if K is not passed than if the user passes the identity matrix.

References

Kang et al. 2008. Efficient control of population structure in model organism association mapping. Genetics 178:1709-1723. Endelman, J.B. 2011. Ridge regression and other kernels for genomic selection with R package rrBLUP. Plant Genome 4:250-255.

Examples

Run this code

#random population of 200 lines with 1000 markers
G <- matrix(rep(0,200*1000),200,1000)
for (i in 1:200) {
  G[i,] <- ifelse(runif(1000)<0.5,-1,1)
}

#random phenotypes
u <- rnorm(1000)
g <- as.vector(crossprod(t(G),u))
h2 <- 0.5  #heritability
y <- g + rnorm(200,mean=0,sd=sqrt((1-h2)/h2*var(g)))

#predict marker effects
ans <- mixed.solve(y,Z=G)  #By default K = I
accuracy <- cor(u,ans$u)

#predict breeding values
ans <- mixed.solve(y,K=A.mat(G))
accuracy <- cor(g,ans$u)

Run the code above in your browser using DataLab