- gData
An object of class gData
containing at least map
,
markers
and pheno
.
- traits
A vector of traits on which to run GWAS. These can be either
numeric indices or character names of columns in pheno
. If NULL
,
GWAS is run on all traits.
- trials
A vector of trials on which to run GWAS. These can
be either numeric indices or character names of list items in pheno
.
If NULL
, GWAS is run for all trials. GWAS is run for the
selected trials in sequential order.
- covar
An optional vector of covariates taken into account when
running GWAS. These can be either numeric indices or character names of
columns in covar
in gData
. If NULL
no covariates are
used.
- snpCov
An optional character vector of snps to be included as
covariates.
- kin
An optional kinship matrix or list of kinship matrices. These
matrices can be from the matrix
class as defined in the base package
or from the dsyMatrix
class, the class of symmetric matrices in the
Matrix package.
If GLSMethod
= "single" then one matrix should be provided, if
GLSMethod
= "multi", a list of chromosome specific matrices of length
equal to the number of chromosomes in map
in gData
.
If NULL
then matrix kinship
in gData
is used.
If both kin
is provided and gData
contains a matrix
kinship
then kin
is used.
- kinshipMethod
An optional character indicating the method used for
calculating the kinship matrix(ces). Currently "astle" (Astle and Balding,
2009), "IBS" and "vanRaden" (VanRaden, 2008) are supported. If a
kinship matrix is supplied either in gData
or in parameter kin
,
kinshipMethod
is ignored.
- remlAlgo
A character string indicating the algorithm used to estimate
the variance components. Either EMMA
, for the EMMA algorithm,
NR
, for the Newton-Raphson algorithm (using
sommer::mmes), or Hend
for the modified Henderson algorithm
(using LMMsolver::LMMsolve).
- GLSMethod
A character string indicating the method used to estimate
the marker effects. Either single
for using a single kinship matrix,
or multi
for using chromosome specific kinship matrices.
- useMAF
Should the minor allele frequency be used for selecting SNPs
for the analysis. If FALSE
, the minor allele count is used instead.
- MAF
The minor allele frequency (MAF) threshold used in GWAS. A
numerical value between 0 and 1. SNPs with MAF below this value are not taken
into account in the analysis, i.e. p-values and effect sizes are put to
missing (NA
). Ignored if useMAF
is FALSE
.
- MAC
A numerical value. SNPs with minor allele count below this value
are not taken into account for the analysis, i.e. p-values and effect sizes
are set to missing (NA
). Ignored if useMAF
is TRUE
.
- genomicControl
Should genomic control correction as in Devlin and
Roeder (1999) be applied?
- thrType
A character string indicating the type of threshold used for
the selection of candidate loci. See Details.
- alpha
A numerical value used for calculating the LOD-threshold for
thrType
= "bonf" and the significant p-Values for thrType
=
"fdr".
- LODThr
A numerical value used as a LOD-threshold when
thrType
= "fixed".
- nSnpLOD
A numerical value indicating the number of SNPs with the
smallest p-values that are selected when thrType
= "small".
- pThr
A numerical value just as the cut off value for p-Values for
thrType
= "fdr".
- rho
A numerical value used a the minimum value for SNPs to be
considered correlated when using thrType
= "fdr".
- sizeInclRegion
An integer. Should the results for SNPs close to
significant SNPs be included? If so, the size of the region in centimorgan
or base pairs. Otherwise 0.
- minR2
A numerical value between 0 and 1. Restricts the SNPs included
in the region close to significant SNPs to only those SNPs that are in
sufficient Linkage Disequilibrium (LD) with the significant snp, where LD
is measured in terms of \(R^2\). If for example sizeInclRegion
=
200000 and minR2
= 0.5, then for every significant SNP also those SNPs
whose LD (\(R^2\)) with the significant SNP is at least 0.5 AND which are
at most 200000 away from this significant snp are included. Ignored if
sizeInclRegion
= 0.
- nCores
A numerical value indicating the number of cores to be used by
the parallel part of the algorithm. If NULL
the number of cores used
will be equal to the number of cores available on the machine - 1.