vcf.stat: Get sequencing quality statistics of raw VCF files
(with GatK generated vcf files only)
Description
This function will generate a table similar to VariantsToTable option in
GatK from raw vcf files for filtering purposes. The function will also
plot all the parameters (see details & values).
Usage
vcf.stat(vcf, plot = TRUE, ...)
Value
Returns a data frame with quality parameters from the INFO. field of
the vcf
QUAL: The Phred-scaled probability that a REF/ALT polymorphism exists
at this site given sequencing data
AC: Allele count
AF: Allele frequency
DP: unfiltered depth
QD: QualByDepth - This is the variant confidence (from the QUAL
field) divided by the unfiltered depth of non-hom-ref samples
FS: FisherStrand - This is the Phred scaled probability that there is
strand bias at the site
SOR: StrandOddsRatio - This is another way to estimate strand bias
using a test similar to the symmetric odds ratio test
MQ: RMSMappingQuality - This is the root mean square mapping quality
over all the reads at the site
MQRankSum: MappingQualityRankSumTest - This is the u-based
z-approximation from the Rank Sum Test for mapping qualities
ReadPosRankSum: ReadPosRankSumTest: This is the u-based
z-approximation from the Rank Sum Test for site position within reads
Arguments
vcf
an imported vcf file in data.frame or matrix format using
readVCF
plot
logical. Whether to plot the (12) parameters
...
other arguments passed on to plot
(e.g. col,border)