A numeric matrix of data. Each row corresponds to a distinct
observation; each column corresponds to a distinct variable/dimension. It
must not contain NA values.
k
An integer indicating the number of cluster centers to initialize.
method
A character string indicating the initialization method to use.
It can take the following values:
"kmeans++":
the centers are selected using the k-means++ algorithm.
"random":
the centers are randomly selected among the values in
x
References
Arthur, D., & Vassilvitskii, S. (2007). k-means++: the advantages
of careful seeding. Proceedings of the Eighteenth Annual ACM-SIAM Symposium
on Discrete Algorithms, 1027–1035.