generate.basic(N.allspmds, N.spmd, N.K.spmd, N, p, K)N.K.spmd is N.spmd,
a vector of length
$K$.N.spmd is N.K number of clusters, as the input
p dimension of data
X.spmd,
as the input
N total sample size, as the input
N.allspmds a collection of sample sizes for all
$S$ processors, as the input
N.spmd total sample size of given processor,
as the input
N.K.spmd sample size of each clusters given
processor, as the input
X.spmd generated data set with dimension with
dimension N.spmd * p
CLASS.spmd
true id of each data, a vector of
length N.spmd
and has values from 1 to K
N.CLASS.spmd true sample size of each clusters, a
vector of length K
}The clusters centers and dispersions are generated automatically inside the code. Currently, it is not allowed for users to change, but it is not difficult to specify them by mimicking this code.
Programming with Big Data in R Website:
generate.MixSim.# Examples can be found in the help pages of em.step(),
# aecm.step(), apecm.step(), and apecma.step().Run the code above in your browser using DataLab