An implementation of AMFA algorithm from WangWan-Lun2020AlomautoMFA. The number of factors, q, is estimated during the fitting process of each MFA model.
The best value of g is chosen as the model with the minimum BIC of all candidate models in the range gmin <= g <= gmax.
AMFA(
Y,
gmin = 1,
gmax = 10,
eta = 0.005,
itmax = 500,
nkmeans = 5,
nrandom = 5,
tol = 1e-05,
conv_measure = "diff",
varimax = FALSE
)An n by p data matrix, where n is the number of observations and p is the number of dimensions of the data.
The smallest number of components for which an MFA model will be fitted.
The largest number of components for which an MFA model will be fitted.
The smallest possible entry in any of the error matrices D_i Jian-HuaZhao2008FMEfautoMFA.
The maximum number of ECM iterations allowed for the estimation of each MFA model.
The number of times the k-means algorithm will be used to initialise models for each combination of g and q.
The number of randomly initialised models that will be used for each combination of g and q.
The ECM algorithm terminates if the measure of convergence falls below this value.
The convergence criterion of the ECM algorithm. The default 'diff' stops the ECM iterations if |l^(k+1) - l^(k)| < tol where l^(k) is the log-likelihood at the kth ECM iteration. If 'ratio', then the convergence of the ECM iterations is measured using |(l^(k+1) - l^(k))/l^(k+1)|.
Boolean indicating whether the output factor loading matrices should be constrained using varimax rotation or not.
A list containing the following elements:
model: A list specifying the final MFA model. This contains:
B: A p by p by q array containing the factor loading matrices for each component.
D: A p by p by g array of error variance matrices.
mu: A p by g array containing the mean of each cluster.
pivec: A 1 by g vector containing the mixing
proportions for each FA in the mixture.
numFactors: A 1 by g vector containing the number of factors for each FA.
clustering: A list specifying the clustering produced by the final model. This contains:
responsibilities: A n by g matrix containing the probability
that each point belongs to each FA in the mixture.
allocations: A n by 1 matrix containing which
FA in the mixture each point is assigned to based on the responsibilities.
diagnostics: A list containing various pieces of information related to the fitting process of the algorithm. This contains:
bic: The BIC of the final model.
logL: The log-likelihood of the final model.
times: A data frame containing the amount of time taken to fit each MFA model.
totalTime: The total time taken to fit the final model.
WangWan-Lun2020AlomautoMFA
Jian-HuaZhao2008FMEfautoMFA
# NOT RUN {
RNGversion('4.0.3'); set.seed(3)
MFA.fit <- AMFA(autoMFA::MFA_testdata,3,3, nkmeans = 3, nrandom = 3, itmax = 100)
# }
Run the code above in your browser using DataLab