
Last chance! 50% off unlimited learning
Sale ends in
Correctly creates a cluster membership variable that can be attached to a dataframe when only a subset of the observations in that dataframe were used to create the clustering solution. NAs are assigned to the observations of the original dataframe not used in creating the clustering solution.
assignCluster(clusterData, origData, clusterVec)
A factor (with integer labels) that indicate the cluster assignment for each observation, with an NA value given to observations not used in the clustering solution.
The data matrix used in the clustering solution. The data matrix may have have only a subset of the observations contained in the original dataframe.
The original dataframe from which the data used in the clustering solution were taken.
An integer variable containing the cluster membership
assignments for the observations used in creating the clustering solution.
This vector can be created using cutree
for clustering solutions
generated by hclust
or the cluster
component of a list object
created by kmeans
or KMeans
.
Dan Putler
data(USArrests)
USArrkm3 <- KMeans(USArrests[USArrests$UrbanPop<66, ], centers=3)
assignCluster(USArrests[USArrests$UrbanPop<66, ], USArrests, USArrkm3$cluster)
Run the code above in your browser using DataLab