Calculate position weight matrix of overlapping sequences. Simulation of human population is based on single nucleotide variation.
calPWM(
kmers,
pseudo.num = 0,
bg.prop = c(a = 0.295, c = 0.205, g = 0.205, t = 0.295),
output = "PWM"
)
A position count/probability/weight matrix.
A vector of k-mers to overlap.
Pseudo-number to avoid numerical instability due to lack of base at a position. Default is zero i.e. no pseudo-number.
Background proportion of bases. Default is c(a = 0.295, c = 0.205, g = 0.205, t = 0.295) which is observed in human genome.
Output matrix type. Options are PCM, PPM, and PWM which refer to position count/probability/weight matrix. Default is PWM.