- sumstat
matrix of summary data, details refer to https://cnsgenomics.com/software/gcta/#COJO.
- ldm
dense or sparse matrix, ld for reference panel (m * m, m is the number of SNPs). NOTE that the order of SNPs should be consistent with summary data.
- method
bayes methods including: "BayesB", "BayesA", "BayesL", "BayesRR", "BayesBpi", "BayesC", "BayesCpi", "BayesR", "CG".
"BayesRR": Bayes Ridge Regression, all SNPs have non-zero effects and share the same variance, equals to RRBLUP or GBLUP.
"BayesA": all SNPs have non-zero effects, and take different variance which follows an inverse chi-square distribution.
"BayesB": only a small proportion of SNPs (1-Pi) have non-zero effects, and take different variance which follows an inverse chi-square distribution.
"BayesBpi": the same with "BayesB", but 'Pi' is not fixed.
"BayesC": only a small proportion of SNPs (1-Pi) have non-zero effects, and share the same variance.
"BayesCpi": the same with "BayesC", but 'Pi' is not fixed.
"BayesL": BayesLASSO, all SNPs have non-zero effects, and take different variance which follows an exponential distribution.
"BayesR": only a small proportion of SNPs have non-zero effects, and the SNPs are allocated into different groups, each group has the same variance.
"CG": conjugate gradient algorithm with assigned lambda.
- map
(optional, only for GWAS) the map information of genotype, at least 3 columns are: SNPs, chromosome, physical position.
- Pi
vector, the proportion of zero effect and non-zero effect SNPs, the first value must be the proportion of non-effect markers.
- lambda
value or vector, the ridge regression value for each SNPs.
- fold
percentage of variance explained for groups of SNPs, the default is c(0, 0.0001, 0.001, 0.01).
- niter
the number of MCMC iteration.
- nburn
the number of iterations to be discarded.
- thin
the number of thinning after burn-in. Note that smaller thinning frequency may have higher accuracy of estimated parameters, but would result in more memory for collecting process, on contrary, bigger frequency may have negative effect on accuracy of estimations.
- windsize
window size in bp for GWAS, the default is 1e6.
- windnum
fixed number of SNPs in a window for GWAS, if it is specified, 'windsize' will be invalid, the default is NULL.
- vg
prior value of genetic variance.
- dfvg
the number of degrees of freedom for the distribution of genetic variance.
- s2vg
scale parameter for the distribution of genetic variance.
- ve
prior value of residual variance.
- dfve
the number of degrees of freedom for the distribution of residual variance.
- s2ve
scale parameter for the distribution of residual variance.
- printfreq
frequency of collecting the estimated parameters and printing on console. Note that smaller frequency may have higher accuracy of estimated parameters, but would result in more time and memory for collecting process, on contrary, bigger frequency may have an negative effect on accuracy of estimations.
- seed
seed for random sample.
- threads
number of threads used for OpenMP.
- verbose
whether to print the iteration information on console.