This function is a HIPAM algorithm adapted to deal with anthropometric data. To that end, a different dissimilarity function is incorporated. This function is that explained in McCulloch et al. (1998) and it is implemented in GetDistMatrix
. We call it $d_{MO}$. In addition, a different method to obtain a classification tree is also incorporated.
Two HIPAM algorithms are proposed. The first one, called $HIPAM_{MO}$, is a HIPAM that uses $d_{MO}$. The second one, $HIPAM_{IMO}$, is a HIPAM algorithm that uses $d_{MO}$ and the INCA (Index Number Clusters Atypical) statistic criterion (Irigoien et al. (2008)) to decide the number of child clusters and as a stopping rule.
See Vinue et al. (2013) for more details.
hipamAnthropom(x,asw.tol=0,maxsplit=5,local.const=NULL,orness=0.7,type,
ahVect=c(23,28,20,25,25),...)
WeightsMixtureUB
and GetDistMatrix
.ah
slopes of the distance function in GetDistMatrix
. Given the five variables considered, this vector is c(23,28,20,25,25). This vector would be different accordclustering: Final clustering that corresponds to the last level of the tree.
asw: The asw of the final clustering.
n.levels: Number of levels in the tree.
medoids: Medoids of all of the clusters in the tree.
active: Activity status of each cluster (FALSE for every cluster of the final partition).
development: Matrix that indicates the ancestors of the final clusters.
num.of.clusters: Number of clusters in the final clustering.
metric: Dissimilarity used (called 'McCulloch' because the dissimilarity function used is that explained in McCulloch et al. (1998)).
getBestPamsamMO
and checkBranchLocalMO
functions, while the $HIPAM_{IMO}$ algorithm uses the getBestPamsamIMO
and checkBranchLocalIMO
functions. For more details of HIPAM, see van der Laan et al. (2003), Wit et al. (2004) and the manual of the
Wit, E., and McClure, J., (2004). Statistics for Microarrays: Design, Analysis and Inference. John Wiley & Sons, Ltd.
Wit, E., and McClure, J., (2006). Statistics for Microarrays: Inference, Design and Analysis. R package version 0.1.
van der Laan, M. J., and Pollard, K. S., (2003). A new algorithm for hybrid hierarchical clustering with visualization and the bootstrap, Journal of Statistical Planning and Inference 117, 275--303.
Pollard, K. S., and van der Laan, M. J., (2002). A method to identify significant clusters in gene expression data. Vol. II of SCI2002 Proceedings, 318--325.
Irigoien, I., and Arenas, C., (2008). INCA: New statistic for estimating the number of clusters and identifying atypical units, Statistics in Medicine 27, 2948--2973.
Irigoien, I., Sierra, B., and Arenas, C., (2012). ICGE: an R package for detecting relevant clusters and atypical units in gene expression, BMC Bioinformatics 13, 1--29.
McCulloch, C., Paal, B., and Ashdown, S., (1998). An optimization approach to apparel sizing, Journal of the Operational Research Society 49, 492--499.
European Committee for Standardization. Size designation of clothes. Part 3: Measurements and intervals. (2005).
Alemany, S., Gonzalez, J. C., Nacher, B., Soriano, C., Arnaiz, C., and Heras, H., (2010). Anthropometric survey of the Spanish female population aimed at the apparel industry. Proceedings of the 2010 Intl. Conference on 3D Body scanning Technologies, 307--315.
getBestPamsamMO
, getBestPamsamIMO
, checkBranchLocalMO
, checkBranchLocalIMO
, plotTreeHipam
, hipamBigGroups
, outlierHipam
dataDef <- dataDemo
bust <- dataDef$bust
bustCirc_4 <- seq(74,102,4) ; bustCirc_6 <- seq(107,131,6) ; bustCirc <- c(bustCirc_4,bustCirc_6)
nsizes <- length(bustCirc)
maxsplit <- 5 ; orness <- 0.7 ; type <- "IMO" #type <- "MO" for $HIPAM_{MO}$
ahVect <- c(23, 28, 20, 25, 25)
hip <- list()
for(i in 1 : (nsizes - 1)){
data = dataDef[(bust >= bustCirc[i]) & (bust < bustCirc[i + 1]), ]
d <- as.matrix(data)
hip[[i]] <- hipamAnthropom(d,maxsplit=maxsplit,orness=orness,type=type,ahVect=ahVect)
}
str(hip)
ress <- list()
for(i in 1 : length(hip)){
ress[[i]] <- table(hip[[i]]$clustering)
}
ress #clustering results in each bust size.
Run the code above in your browser using DataLab