When running a QC over multiple files, QC_series
collects the values of the skewness_HQ
and kurtosis_HQ
output of QC_GWAS
in a table, which is then
passed to this function to convert it into a plot. Note that
this values are calculated over high-quality SNPs only.
Kurtosis is a measure of how well a distribution matches a
Gaussian distribution. A Gaussian distribution has a kurtosis
of 0
. Negative kurtosis indicates a flatter distribution
curve, while positive kurtosis indicates a sharper, thinner
curve.
Skewness is a measure of distribution asymmetry. A symmetrical
distribution has skewness 0
. A positive skewness
indicates a long tail towards higher values, while a negative
skewness indicates a long tail towards lower values.
Ideally, one expects both the skewness and kurtosis of effect
sizes to be close to 0
. In practice, these statistics
can be hugely variable. QC_series
uses only high-quality
effect sizes to calculate these values in order to reduce some
of the more extreme values. Still, it is recommended that you
compare the values to those of other GWAS with the same
phenotype, rather than relying on on the label outliers
command to identify problems.