The denoiSeq package is based on a bottom-up model for PCR sequencing
developed by Ndifon et al. (2012). The model generates, in a bottom-up
manner, a probability distribution for the final copy number of a gene, that
is a superposition of the negative binomial and the binomial distributions.
The derived distribution has three main parameters, i.e N, p and
f, which represent the initial gene amount before amplification,
the amplification efficiency and the dilution rate, respectively.
Bayesian inference is used to estimate the model parameters. The counts in
each column are used to estimate the size factors (Anders and Huber, 2010)
which are in turn used to normalise the counts. For an \(m\) by \(n\)
matrix, inference aims at estimating the three sets of parameters, i.e
\(p, f\) and \(N_i\) <U+2019>s (2m in total because we are considering 2
conditions with the same m genes in each). denoiseq uses the rows in
each condition to estimate parameter \(N_i\) for each gene in that
condition, and uses the entire dataset, combined from both conditions,
to estimate \(p\) and \(f\).
For differential expression analysis, the primary parameters of interest are
\(N_{iA}\) and \(N_{iB}\) (from conditions A and B respectively), for
each gene \(i\).