Last chance! 50% off unlimited learning
Sale ends in
Generates a finite mixture model with Gaussian components for prespecified levels of maximum and/or average overlaps.
MixSim(BarOmega = NULL, MaxOmega = NULL, K, p, sph = FALSE, hom = FALSE,
ecc = 0.90, PiLow = 1.0, int = c(0.0, 1.0), resN = 100,
eps = 1e-06, lim = 1e06)
value of desired average overlap.
value of desired maximum overlap.
number of components.
number of dimensions.
covariance matrix structure (FALSE - non-spherical, TRUE - spherical).
heterogeneous or homogeneous clusters (FALSE - heterogeneous, TRUE - homogeneous).
maximum eccentricity.
value of the smallest mixing proportion (if 'PiLow' is not reachable with respect to K, equal proportions are taken; PiLow = 1.0 implies equal proportions by default).
mean vectors are simulated uniformly on a hypercube with sides specified by int = (lower.bound, upper.bound).
maximum number of mixture resimulations.
error bound for overlap computation.
maximum number of integration terms (Davies, 1980).
vector of mixing proportions.
matrix consisting of components' mean vectors (K * p).
set of components' covariance matrices (p * p * K).
matrix of misclassification probabilities (K * K); OmegaMap[i,j] is the probability that X coming from the i-th component is classified to the j-th component.
value of average overlap.
value of maximum overlap.
row and column numbers for the pair of components producing maximum overlap 'MaxOmega'.
flag value; 0 represents successful mixture generation, 1 represents failure.
If 'BarOmega' is not specified, the function generates a mixture solely based on 'MaxOmega'; if 'MaxOmega' is not specified, the function generates a mixture solely based on 'BarOmega'.
Maitra, R. and Melnykov, V. (2010) ``Simulating data to study performance of finite mixture modeling and clustering algorithms'', The Journal of Computational and Graphical Statistics, 2:19, 354-376.
Melnykov, V., Chen, W.-C., and Maitra, R. (2012) ``MixSim: An R Package for Simulating Data to Study Performance of Clustering Algorithms'', Journal of Statistical Software, 51:12, 1-25.
Davies, R. (1980) ``The distribution of a linear combination of chi-square random variables'', Applied Statistics, 29, 323-333.
overlap
, pdplot
, and simdataset
.
# NOT RUN {
<!-- %\dontrun{ -->
# }
# NOT RUN {
set.seed(1234)
# controls average and maximum overlaps
(ex.1 <- MixSim(BarOmega = 0.05, MaxOmega = 0.15, K = 4, p = 5))
summary(ex.1)
# controls average overlap
(ex.2 <- MixSim(BarOmega = 0.05, K = 4, p = 5, hom = TRUE))
summary(ex.2)
# controls maximum overlap
(ex.3 <- MixSim(MaxOmega = 0.15, K = 4, p = 5, sph = TRUE))
summary(ex.3)
# }
# NOT RUN {
<!-- %} -->
# }
Run the code above in your browser using DataLab