mergenormals(xdata, mclustsummary=NULL,
clustering, probs, muarray, Sigmaarray, z,
method=NULL, cutoff=NULL, by=0.005,
numberstop=NULL, renumber=TRUE, M=50, ...) ## S3 method for class 'mergenorm':
summary(object, ...)
## S3 method for class 'summary.mergenorm':
print(x, ...)
summary.mclustBIC
for xdata
. Either
mclustsummary
or all of clustering
,
probs
, muarray
,
"bhat"
, "ridge.uni"
,
"ridge.ratio"
, "demp"
, "dipuni"
,
"diptantrum"
, "predictive"
. See details."ridge.uni"
and
"ridge.ratio"
. Methods "dipuni"
and
"diptantrum"
require ridgeline computaticutoff
is ignored and
components are merged until the number of clusters specified here is
reached.TRUE
merged clusters are renumbered
from 1 to their number. If not, numbers of the original clustering
are used (numbers of components that were merged into others then
will not appear).method="predictive"
.ridgeline.diagnosis
or mixpredictive
(in
mergenormals
).mergenorm
, output of
mergenormals
.summary.mergenorm
, output of
summary.mergenorm
.mergenormals
gives out an object of class mergenorm
,
which is a List with componentsrenumber=TRUE
, in which case they may be needed to understand
the numbering of some further components, see below.mclustsummary
was provided. Entry
no. i refers to number i in clusternumbers
. The list entry i
contains the parameters of the original mixture components that
make up cluster i, as extracted by
extract.mixturepars
.method=="predictive"
. See
mixpredictive
.orig.decisionmatrix
,
but with final entries; numbering of rows and columns corresponds to
clusternumbers
; all entries corresponding to other rows and
columns can be ignored.probs
(see arguments)
for merged components, generated by (potentially repeated) execution
of mergeparameters
out of the original
ones. Numbered according to clusternumbers
.probs
.probs
.probs
.TRUE
, there was a noise component
fitted in the initial mclust clustering (see help for
initialization
in mclustBIC
). In this
case, a cluster number 0 indicates noise. noise is ignored by the
merging methods and kept as it was originally.summary.mergenorm
gives out a list with components
clustering, clusternumbers, defunct.components, valuemerged,
mergedtonumbers, predvalues, probs, muarray, Sigmaarray, z, noise,
method, cutoff
as above, plus onc
(original number of
components) and mnc
(number of clusters after merging).method="predictive"
) are merged. Then criterion values are
recomputed for the merged cluster. Merging is continued until the
criterion value to merge is below (or above, for
method="predictive"
) the cutoff value. Details are given in
Hennig (2010). The following criteria are offered, specified by the
method
-argument.
[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object]Hennig, C. (2010) Methods for merging Gaussian mixture components, Advances in Data Analysis and Classification, 4, 3-34. Ray, S. and Lindsay, B. G. (2005) The Topography of Multivariate Normal Mixtures, Annals of Statistics, 33, 2042-2065.
Tantrum, J., Murua, A. and Stuetzle, W. (2003) Assessment and Pruning of Hierarchical Model Based Clustering, Proceedings of the ninth ACM SIGKDD international conference on Knowledge discovery and data mining, Washington, D.C., 197-205.
Tibshirani, R. and Walther, G. (2005) Cluster Validation by Prediction Strength, Journal of Computational and Graphical Statistics, 14, 511-528.
data(crabs)
dc <- crabs[,4:8]
cm <- mclustBIC(crabs[,4:8],G=9,modelNames="EEE")
scm <- summary(cm,crabs[,4:8])
cmnbhat <- mergenormals(crabs[,4:8],scm,method="bhat")
summary(cmnbhat)
cmndemp <- mergenormals(crabs[,4:8],scm,method="demp")
summary(cmndemp)
# Other methods take a bit longer, but try them!
# The values of by and M below are still chosen for reasonably fast execution.
# cmnrr <- mergenormals(crabs[,4:8],scm,method="ridge.ratio",by=0.05)
# cmd <- mergenormals(crabs[,4:8],scm,method="dip.tantrum",by=0.05)
# cmp <- mergenormals(crabs[,4:8],scm,method="predictive",M=3)
Run the code above in your browser using DataLab