Dirichlet Process Clustering with Dissimilarities (DPCD) models dissimilarity
data using an infinite mixture model with a Dirichlet Process prior. The six
available covariance structures for mixture components are:
"UU": Unequal Unrestricted — each component has its own unrestricted covariance matrix.
"EU": Equal Unrestricted — components share a common unrestricted covariance matrix.
"UD": Unequal Diagonal — each component has its own diagonal covariance matrix.
"ED": Equal Diagonal — components share a common diagonal covariance matrix.
"US": Unequal Spherical — each component has its own spherical covariance matrix.
"ES": Equal Spherical — components share a common spherical covariance matrix.
The hyper_params list allows users to specify custom hyperparameter values.
Some hyperparameters are common across all models, while others depend on the
selected covariance structure.
Common hyperparameters:
alpha_0: Concentration parameter for the Dirichlet Process prior.
a_0, b_0: Shape and scale parameters for the Inverse-Gamma prior on the
measurement error parameter.
lambda: Scaling parameter for the prior on component means.
mu_0: Mean vector for the prior on component means.
Model-specific hyperparameters:
nu_0 and Psi_0(degrees of freedom and scale matrix for the Inverse-Wishart prior) - UU and EU only.
alpha_tau and beta_tau (shape and scale
parameters for the Inverse-Gamma prior) - UD, ED, US, and ES only.
The init_params list allows users to supply initial values for model
parameters to assist MCMC convergence. The following parameters may be
initialized:
x: n × p matrix of latent positions.
sigma_sq: Scalar measurement error variance.
mu: trunc_value × p matrix of component means.
Sigma: p × p covariance matrix.
tau_sq: Scalar variance parameter (for "US" and "ES" only).
tau_vec: Length-p variance vector (for "UD" and "ED" only).
beta: Length trunc_value-1 vector of stick-breaking weights.
z: Length-n vector of cluster assignments.
Default values are used for both hyper_params and init_params if none are
supplied.
The output_params vector specifies which model parameters should be saved in
the MCMC output. Valid names include "beta", "pi", "z", "mu",
"Sigma", "sigma_sq", "x", and "delta".