Apply hierarchical clustering to selected metrics. Person averages are computed prior to clustering. The hierarchical clustering uses cosine distance and the ward.D method of agglomeration.
personas_hclust(data, metrics, k = 4, return = "plot")
A data frame containing PersonId
and selected metrics for
clustering.
Character vector containing names of metrics to use for clustering. See examples section.
Numeric vector to specify the k
number of clusters to cut by.
String specifying what to return. This must be one of the following strings:
"plot"
"data"
"table"
"hclust"
See Value
for more information.
A different output is returned depending on the value passed to the return
argument:
"plot"
: 'ggplot' object. A heatmap plot comparing the key metric averages
of the clusters as per keymetrics_scan()
.
"data"
: data frame. Raw data with clusters appended
"table"
: data frame. Summary table for identified clusters
"hclust"
: 'hclust' object. hierarchical model generated by the function.
Other Clustering:
workpatterns_classify()
,
workpatterns_hclust()
# NOT RUN {
# Return plot
personas_hclust(sq_data,
metrics = c("Collaboration_hours", "Workweek_span"),
k = 4)
# Return summary table
personas_hclust(sq_data,
metrics = c("Collaboration_hours", "Workweek_span"),
k = 4,
return = "table")
# }
# NOT RUN {
# Return data with clusters appended
personas_hclust(sq_data,
metrics = c("Collaboration_hours", "Workweek_span"),
k = 4,
return = "data")
# }
# NOT RUN {
# }
Run the code above in your browser using DataLab