KmeansPP

Data file name on disk (NUMA optimized) or In memory data matrix

data

centers

The number of samples in the dataset

nrow

The number of features in the dataset

ncol

The number of iterations of kmeans++ to run

nstart

The number of parallel threads to run

nthread

What dissimilarity metric to use c("taxi", "eucl", "cos")

dist.type

A parallel and scalable implementation of the algorithm described in
Ostrovsky, Rafail, et al. "The effectiveness of Lloyd-type methods for
 the k-means problem." Journal of the ACM (JACM) 59.6 (2012): 28.

The clustering 'NUMA' Optimized Routines package or 'clusternor' is a highly optimized package for performing clustering in parallel with accelerations specifically targeting multi-core Non-Uniform Memory Access ('NUMA') hardware architectures. Disa Mhembere, Da Zheng, Carey E. Priebe, Joshua T. Vogelstein, Randal Burns (2019) <arXiv:1902.09527>.

KmeansPP: Perform the k-means++ clustering algorithm on a data matrix.

Description

Usage

Arguments

Value

Examples