Learn R Programming

rrBLUP (version 3.2)

rrBLUP-package: Ridge regression and other kernels for genomic selection

Description

This package has been designed for both genomic selection and association mapping. Some details of using the package for genomic selection have been published in the Plant Genome. The heart of the package is the function mixed.solve, which can be used to model marker effects as random effects or the genotypic values of the lines as random effects. In the latter case, the function A.mat is useful for calculating the additive relationship matrix and the prediction of breeding values. To include epistatic effects in the genotypic value predictions, use the function kinship.BLUP.

Arguments

Missing data

A number of improvements have been made concerning the handling of missing data since the original publication of the package. Both mixed.solve and kinship.BLUP will tolerate missing phenotypic data: those observations are simply not used. In addition, the package now has several features for handling missing genotypic data. For breeding value predictions, the function A.mat will calculate the additive relationship matrix using pairwise complete observations. This provides an unbiased estimate of the true A matrix but the calculation is more time consuming than if one imputes before calculating A. For small amounts of missing data, using impute to fill in with the population mean is a good option. For large amounts of missing data, I recommend using A.mat without imputing even though it takes longer. The function kinship.BLUP tolerates missing genotypic data for any of the three kernels.

Association mapping

Use function GWA for association mapping.

Parallel computing

For Mac, Linux, and UNIX users, R package multicore can be used in conjunction with rrBLUP to take advantage of multiple processors on a single machine. For large data sets, especially when there is missing data, I highly recommend trying out this feature, which is available with kinship.BLUP, A.mat, and GWA. You need R >= 2.14.1 for this to work properly, and you must also use R from the command line (not the GUI).

References

Endelman, J.B. 2011. Ridge regression and other kernels for genomic selection with R package rrBLUP. Plant Genome 4:250-255.