Compute validity measures for partitions and hierarchies, attempting to measure how well these clusterings capture the underlying structure in the data they were obtained from.
cl_validity(x, ...)
# S3 method for default
cl_validity(x, d, ...)
A list of class "cl_validity"
with the computed validity
measures.
an object representing a partition or hierarchy.
a dissimilarity object from which x
was obtained.
arguments to be passed to or from methods.
cl_validity
is a generic function.
For partitions, its default method gives the “dissimilarity
accounted for”, defined as
For hierarchies, the validity measures computed by default are
“variance accounted for” (VAF, e.g., Hubert, Arabie & Meulman,
2006) and “deviance accounted for” (DEV, e.g., Smith, 2001).
If u
is the ultrametric corresponding to the hierarchy x
and d
the dissimilarity x
was obtained from, these
validity measures are given by
u
, and may be “arbitrarily small” (i.e., 0 using the
above definitions) even though u
and d
are
“structurally close” in some sense.
For the results of using agnes
and
diana
, the agglomerative and divisive
coefficients are provided in addition to the default ones.
L. Hubert, P. Arabie and J. Meulman (2006). The structural representation of proximity matrices with MATLAB. Philadelphia, PA: SIAM.
T. J. Smith (2001).
Constructing ultrametric and additive trees based on the
cluster.stats
in package fpc for a variety of
cluster validation statistics;
fclustIndex
in package e1071 for several
fuzzy cluster indexes;
clustIndex
in package cclust;
silhouette
in package cluster.