amltest
and then forming a matrix including both main marker effects and two-way epistatic effects.
epigen(response, marker, kin, numkeep=floor(length(response)*.5), selectvar, corbnd=0.5, mafb=0.04, method="complete")
amltest
.cleanclust
can be used to re-encode it. The function cleanclust
should also be used to preprocess the marker data to remove marker with a high proportion of missing values or very low minor allele frequency as well as impute missing values with the sample mean. It is also recommend that cleanclust
be used to filter the markers so that no markers are highly correlated. It is passed to amltest
.marker
. It is passed to amltest
. amltest
. It should be less than the number of lines. The default value is a half of the number of lines. amltest
. The number of main marker effects that are retained could be slightly less than selectvar
. See the documentation for amltest
.cleanclust
.cleanclust
.hclust
. The values could be one of "complete", "average" or "single". The default is "complete". See the documentation for cleanclust
. effects
, or the marker name for a main effect.effects
, or the marker name for a main effect.amltest
is called first to select a subset of markers with the most significant main effects. Then two-way epistatic effects are formed from these selected markers by taking the product of the two columns corresponding to each pair of markers. Subsequently, the cleanclust
function is called to remove effects with very low mean values and also filter the effects such that no two effects are highly correlated. The resulted genetic effect matrix include both main effects and epistatic effects. It can then be used as input for amltest
in the same manner as a marker matrix.
Wang, D., Eskridge, K.M. and Crossa, J. (2011) Identifying QTLs and Epistasis in Structured Plant Populations Using Adaptive Mixed LASSO. Journal of Agricultural, Biological, and Environmental Statistics, 16:170-184.
Wang, D., et al. (2012) Prediction of genetic values of quantitative traits with epistatic effects in plant breeding populations. Heredity, 109: 313-319.
amltest
, cleanclust
.
## process the markers in the wheat data set.
data("wheat")
clmarker<- cleanclust(wheat$marker, nafrac=0.2, mafb=0.1, corbnd=0.5, method="complete")
intermat <- epigen(wheat$y, clmarker$newmarker, wheat$A, numkeep=100, selectvar=30,
corbnd=0.5, mafb=0.04)
Run the code above in your browser using DataLab