updog Flexible Genotyping for PolyploidsImplements empirical Bayes approaches to genotype
polyploids from next generation sequencing data while
accounting for allelic bias, overdispersion, and sequencing
error. The main function is flexdog, which allows
the specification of many different genotype distributions.
An experimental function that takes into account varying
levels of relatedness is implemented in mupdog.
Also provided are functions to simulate genotypes
(rgeno) and read-counts
(rflexdog), as well as functions to calculate
oracle genotyping error rates (oracle_mis) and
correlation with the true genotypes (oracle_cor).
These latter two functions are useful for read depth calculations.
Run browseVignettes(package = "updog") in R
for example usage. The methods are described in detail in
Gerard et. al. (2018) and Gerard and Ferr<U+00E3>o (2019).
flexdogThe main function that fits an empirical Bayes approach to genotype polyploids from next generation sequencing data.
multidogA convenience function for running
flexdog over many SNPs. This function provides
support for parallel computing.
mupdogAn experimental approach to genotype autopolyploids that accounts for varying levels of relatedness between the individuals in the sample.
rgenosimulate the genotypes of a sample
from one of the models allowed in flexdog.
rflexdogSimulate read-counts from the
flexdog model.
plot.flexdogPlotting the output of
flexdog.
plot.mupdogPlotting the output of
mupdog.
oracle_jointThe joint distribution of the true genotype and an oracle estimator.
oracle_plotVisualize the output of oracle_joint.
oracle_misThe oracle misclassification error rate (Bayes rate).
oracle_corCorrelation between the true genotype and the oracle estimated genotype.
snpdatA small example dataset for using
flexdog.
uitdewilligenA small example dataset
for using mupdog.
mupoutThe output from fitting
mupdog to uitdewilligen.
The package is named updog for "Using
Parental Data for Offspring Genotyping" because
we originally developed the
method for full-sib populations, but it works
now for more general populations.
Our best competitor is probably the fitPoly package,
which you can check out at
https://cran.r-project.org/package=fitPoly. Though, we think
that updog returns better calibrated measures of uncertainty
when you have next-generation sequencing data.
If you find a bug or want an enhancement, please submit an issue at http://github.com/dcgerard/updog/issues.
Gerard, D., Ferr<U+00E3>o, L. F. V., Garcia, A. A. F., & Stephens, M. (2018). Genotyping Polyploids from Messy Sequencing Data. Genetics, 210(3), 789-807. doi: 10.1534/genetics.118.301468.
Gerard, D. and Ferr<U+00E3>o, L. F. V. (2019). Priors for Genotyping Polyploids. Bioinformatics. doi: 10.1093/bioinformatics/btz852.