Create profiles of observed variables using two-step cluster analysis
create_profiles(df, ..., n_profiles, to_center = FALSE, to_scale = FALSE,
distance_metric = "squared_euclidean", linkage = "complete",
plot_centered_data = FALSE, plot_raw_data = FALSE)
with two or more columns with continuous variables
unquoted variable names separated by commas
The specified number of profiles to be found for the clustering solution
Boolean (TRUE or FALSE) for whether to center the raw data with M = 0
Boolean (TRUE or FALSE) for whether to scale the raw data with SD = 1
Distance metric to use for hierarchical clustering; "squared_euclidean" is default but more options are available (see ?hclust)
Linkage method to use for hierarchical clustering; "complete" is default but more options are available (see ?dist)
Boolean (TRUE or FALSE) for whether to center the data before plotting (should not be used if to_center = T; only if to_center = F, in cases in which raw data is used to create profiles but centered profiles are desired for visualization purposes)
Boolean (TRUE or FALSE) for whether to plot the raw data, regardless of whether the data are centered or scaled before clustering.
A list containing the prepared data, the output from the hierarchical and k-means cluster analysis, the r-squared value, raw clustered data, processed clustered data of cluster centroids, and a ggplot object.
Function to create a specified number of profiles of observed variables using a two-step (hierarchical and k-means) cluster analysis.
# NOT RUN {
create_profiles(mtcars, disp, hp, wt, n_profiles = 2, to_scale = TRUE)
# }
Run the code above in your browser using DataLab