Usage
PoisMixClus(y, g, conds, lib.size = TRUE, lib.type = "TMM",
init.type = "small-em", init.runs = 1, init.iter = 10,
alg.type = "EM", cutoff = 10e-6, iter = 1000, fixed.lambda = NA,
equal.proportions = FALSE, prev.labels = NA,
prev.probaPost = NA, verbose = FALSE, interpretation = "sum",
EM.verbose = FALSE)
PoisMixClusWrapper(y, gmin = 1, gmax, conds, lib.size = TRUE,
lib.type = "TMM", gmin.init.type = "small-em",
init.runs = 1, init.iter = 10, split.init = TRUE, alg.type = "EM",
cutoff = 10e-6, iter = 1000, fixed.lambda = NA,
equal.proportions = FALSE, verbose = FALSE, interpretation = "sum",
EM.verbose = FALSE)Arguments
y
(n x q) matrix of observed counts for n observations and q variables
g
Number of clusters (a single value). If fixed.lambda contains a list of lambda values to be fixed,
g corresponds to the number of clusters in addition to those fixed.
gmin
The minimum number of clusters in a sequence to be tested, where gmin corresponds to the
minimum number of clusters in addition to those fixed.
gmax
The maximum number of clusters in a sequence to be tested, where gmax corresponds to the
maximum number of clusters in addition to those fixed.
conds
Vector of length q defining the condition (treatment group) for each variable (column) in y
lib.size
If FALSE, the library size parameter is not included in the model (i.e., the PMM-I model). If TRUE, the library size parameter is included in the Poisson mixture model (i.e., the PMM-II model)
lib.type
If lib.size = TRUE, the estimator to be used for the library size parameter: (TC for total count, UQ for upper quantile, Med for median,
init.type
Type of initialization strategy to be used (small-em for the Small-EM strategy described in Rau et al. (2011), and kmeans for a simple K-means initialization)
gmin.init.type
Type of initialization strategy to be used for the minimum number of clusters in a sequence (gmin):
(small-em for the Small-EM strategy described in Rau et al. (2011), and kmeans f
init.runs
Number of iterations to be used for the Small-EM strategy described in Rau et al. (2011), with a default value of 1
init.iter
Number of iterations to be used within each run for the Small-EM strategry, with a default value of 10
split.init
If TRUE, the splitting initialization strategy of Papastamoulis et al. (2012) will be used for cluster sizes
(gmin+1, ..., gmax). If FALSE, the initialization strategy specified in gmin.init.type<
alg.type
Algorithm to be used for parameter estimation (EM or CEM)
cutoff
Cutoff to declare algorithm convergence (in terms of differences in log likelihoods from one iteration to the next)
iter
Maximum number of iterations to be run for the chosen algorithm
fixed.lambda
If one (or more) clusters with fixed values of lambda is desired, a list containing vectors of length d (the number of conditions). Note that the values of lambda chosen must satisfy the constraint noted in the technical report.
equal.proportions
If TRUE, the cluster proportions are set to be equal for all clusters. Default is FALSE (unequal cluster proportions).
prev.labels
A vector of length n of cluster labels obtained from the previous run (g-1 clusters) to be used with the splitting
small-EM strategy described in described in Papastamoulis et al. (2012). For other initialization strategies, this
parameter tak
prev.probaPost
An n x (g-1) matrix of the conditional probabilities of each observation belonging to each of the
g-1 clusters from the previous run, to be used with the splitting small-EM strategy of described in Papastamoulis
et al. (2012
verbose
If TRUE, include verbose output
interpretation
If "sum", cluster behavior is interpreted with respect to overall gene expression level (sums per gene),
otherwise for "mean", cluster behavior is interpreted with respect to mean gene expression (means per gene).
EM.verbose
If TRUE, more informative output is printed about the EM algorithm, including the number of iterations run and
the difference between log-likelihoods at the last and penultimate iterations.