This is the significance clustering procedure of Bello et al. (2021).
The method first performs a homogeneity test to verify whether the data can be significantly
partitioned. If the hypothesis of homogeneity is rejected, then the method will search, among all
the significant partitions, for the partition that better separates the data, as measured by larger
bn
statistic. This function should be used in high dimension small sample size settings.
Either data
or md
should be provided.
If data are entered directly, Bn will be computed considering the squared Euclidean distance.
Variance of bn
is estimated through resampling, and thus, p-values may vary a bit in different runs.
For more detail see
Bello, Debora Zava, Marcio Valk and Gabriela Bettella Cybis.
"Clustering inference in multiple groups." arXiv preprint arXiv:2106.09115 (2021).
See also is_homo3
, uclust
.