amltest(response, marker, kin, numkeep=floor(length(response)*.5), selectvar)
cleanclust
can be used to re-encode it. The function cleanclust
should also be used to preprocess the marker data to remove marker with a high proportion of missing values or very low minor allele frequency as well as impute missing values with the sample mean. It is also recommend that cleanclust
be used to filter the markers so that no markers are highly correlated. marker
. selectvar
. See Details.marker
is included in the model fit and the second column is the effect for each marker in the model.estimate
.estimate
.estimate
.amltest
first performs a preliminary screening to retain a set of markers (predictors) numbering at most numkeep
, which should be less than the number of lines. This step relies on LASSO fitting using lars. The quantity numkeep
is the maximum steps of iterations in LASSO fit. Due to the nature of the lars algorithm, the number of markers retained after the screening might be slightly less than numkeep.
Then amltest
will perform adaptive mixed LASSO fit by iteratively estimating the fixed effects and random effects up to the number of iterations defined by selectvar
. Again, the number of markers in the output might be slightly less than selectvar
as determined by the behavior of the lars algorithm. So if an exact number of markers are required in the model, some trial and error might be needed.
Wang, D., et al. (2012) Prediction of genetic values of quantitative traits with epistatic effects in plant breeding populations. Heredity, 109: 313-319.
cleanclust
.
## analyze the wheat data with main marker effects.
data("wheat")
clmarker<- cleanclust(wheat$marker, nafrac=0.2, mafb=0.1, corbnd=0.5, method="complete")
resmain <- amltest(wheat$y, clmarker$newmarker, wheat$A, numkeep=80, selectvar=40)
Run the code above in your browser using DataLab