The function to calculate allele median ratios, proportion of heterozygotes and allele probability values under different assumptions (see details), and their chi-square significance values for duplicate detection
allele.info(
X,
x.norm = NULL,
Fis,
method = c("MedR", "QN", "pca", "TMM", "TMMex"),
logratioTrim = 0.3,
sumTrim = 0.05,
Weighting = TRUE,
Acutoff = -1e+10,
plot.allele.cov = TRUE,
verbose = TRUE,
parallel = FALSE,
...
)
Returns a data frame of median allele ratio, proportion of heterozygotes, number of heterozygotes, and allele probability at different assumptions with their chi-square significance
allele depth table generated from the function
hetTgen
(non-normalized)
a data frame of normalized allele coverage, output of
cpm.normal
. If not provided, calculated using X
.
numeric. Inbreeding coefficient calculated using h.zygosity()
function
character. method to be used for normalization
(see cpm.normal
details). Default TMM
numeric. percentage value (0 - 1) of variation to be trimmed in log transformation
numeric. amount of trim to use on the combined absolute
levels (“A” values) for method TMM
logical, whether to compute (asymptotic binomial precision) weights
numeric, cutoff on “A” values to use before trimming
logical, plot comparative plots of allele depth coverage in homozygotes and heterozygotes
logical, whether to print progress
logical. whether to parallelize the process
further arguments to be passed to plot
Piyal Karunarathne, Pascal Milesi, Klaus Schliep
Allele information generated here are individual SNP based and presents the proportion of heterozygotes, number of samples, and deviation of allele detection from a 1:1 ratio of reference and alternative alleles. The significance of the deviation is tested with Z-score test \(Z = \frac{ \frac{N}{2}-N_A}{ \sigma_{x}}\), and chi-square test (see references for more details on the method).
McKinney, G. J., Waples, R. K., Seeb, L. W., & Seeb, J. E. (2017). Paralogs are revealed by proportion of heterozygotes and deviations in read ratios in genotyping by sequencing data from natural populations. Molecular Ecology Resources, 17(4)
Karunarathne et al. 2022 (to be added)
if (FALSE) data(ADtable)
hz<-h.zygosity(vcf,verbose=FALSE)
Fis<-mean(hz$Fis,na.rm = TRUE)
AI<-allele.info(ADtable,x.norm=ADnorm,Fis=Fis)
Run the code above in your browser using DataLab