tMixture
in OCplus
package.discTMix(tstat, n1 = 10, n2 = n1, nq, p0, p1, D, delta, paired = FALSE,
tbreak, ext = TRUE, threshold.delta=0.75, ...)
D
.p1
.D
, though if both are given, delta
will get priority.tstat
, or the explicit break points for the bins, very much like the argument breaks
to function cut
; the default value is the square root of the number of genedelta
below this value are considered to be too small for independent estimation; these components and their corresponding p1
are pooled with the null-componeoptim
to control the optimization.class
discTMix
, with the following components:threshold.delta
.p0.raw
.optim
, giving details about the optimization process.nq
- if nothing else is given, the proportions are equally distributed between p0
and the p1
, and the noncentrality parameters are set up symmetrically around zero, e.g. nq=5
leads to equal proportions of 0.2 and noncentrality parameters -2, -1, 1, and 2. If any of p1
, D
, or delta
is specified, nq
is redundant and will be ignored (with a warning). discTMix
will in general make a valiant effort to deduce valid starting values from any combination of nq
, p0
, p1
, D
, and delta
specified by the user, and will complain if that is not possible. The fitting problem that this function tries to solve is badly conditioned, and will in general depend on the precise set of starting values. Multiple runs from different starting values are usually a good idea. We have found however, that the model seems fairly robust towards misspecification of the number of components, at least when estimating p0
. What happens when too many components are specified is that some of the nominally noncentral t-distributions describing the behaviour of differentially expressed genes are fitted with noncentrality parameters very close to zero, and the true p0
gets spread out between the nominal p0
and the almost-central components. Adding up these different contributions usually gives a similar solution to re-fitting the model with fewer components. The cutoff for the size of non-centrality parameters that can be estimated realistically is specified via threshold.delta
, whose default value is based on a small simulation study reported in Pawitan et al. (2005); see Examples. (Note that the AIC can also be helpful in determining the number of components.)
tstatistics
, EOC
, optim
, fitted.discTMix