Usage
Matrix_eQTL_main(
snps,
gene,
cvrt = SlicedData$new(),
output_file_name = "",
pvOutputThreshold = 1e-5,
useModel = modelLINEAR,
errorCovariance = numeric(),
verbose = TRUE,
output_file_name.cis = "",
pvOutputThreshold.cis = 0,
snpspos = NULL,
genepos = NULL,
cisDist = 1e6,
pvalue.hist = FALSE)
Matrix_eQTL_engine(
snps,
gene,
cvrt = SlicedData$new(),
output_file_name,
pvOutputThreshold = 1e-5,
useModel = modelLINEAR,
errorCovariance = numeric(),
verbose = TRUE,
pvalue.hist = FALSE)
Arguments
snps
SlicedData
object with genotype information.
Can be real-valued for linear model and
should take up to 3 distinct values for ANOVA (see useModel
parameter).
gene
SlicedData
object with gene expression information.
Should have columns matching those of snps
.
cvrt
SlicedData
object with additional covariates.
Can be an empty SlicedData
object in case of no covariates.
The columns must match those in snps
and gene
.
output_file_name
character string with the name of the output file.
Significant (all or distant) associations are saved to this file.
If the file with this name exists, it will be overwritten.
output_file_name.cis
character string with the name of the output file.
Significant local associations are saved to this file.
If the file with this name exists, it will be overwritten.
pvOutputThreshold
numeric. Only gene-SNP pairs significant at this level will be saved in output_file_name
.
pvOutputThreshold.cis
Same as pvOutputThreshold
, but for local eQTLs.
If both thresholds are positive, pvOutputThreshold
determines cut-off for distant (trans) eQTLs.
useModel
numeric. Can be modelLINEAR
, modelANOVA
, or modelLINEAR_CROSS
.
See the section above for description.
errorCovariance
numeric. The error covariance matrix. Use numeric()
for homoskedastic independent errors.
verbose
logical. Set to TRUE
to display detailed report on the progress.
snpspos
data.frame
object with information about SNP locations, must have 3 columns - SNP name, chromosome, and position.
genepos
data.frame
with information about transcript locations, must have 4 columns - the name, chromosome, and positions of the left and right ends.
cisDist
numeric. SNP-gene pairs within this distance are considered local. The distance is measured from the nearest end of the gene.
pvalue.hist
This parameter defines how the distribution of (all/local/distant) p-values is recorded.
If pvalue.hist
is FALSE
, the information is not recorded and thus the analysis is performed faster.
Set pvalue.hist = "qqplot"