Calculate string similarity using the Levenshtein distance and return
clusters of similar strings.
Usage
ClusterStrings(x, maxCluster = 12)
Value
NameClusters() returns an integer assigning each element of x
to a cluster, with an attribute med specifying the median string in each
cluster, and silhouette reporting the silhouette coefficient of the optimal
clustering. Coefficients < 0.5 indicate weak structure, and no clusters are
returned. If the number of unique elements of x is less than maxCluster,
all occurrences of each entry are assigned to an individual cluster.
Arguments
x
Character vector.
maxCluster
Integer specifying maximum number of clusters to consider.