The AIC is a measure (by Kullback-Leibler directed distance, up to an additive constant) of quality of prediction of new data by a fitted model.
Comparing information criteria may be viewed as a fast alternative to a comparison of the predictive accuracy of different models by cross-validation. Further procedures for model choice may also be useful (e.g. Williams, 1970; Lewis et al. 2010).
The conditional AIC (Vaida and Blanchard 2005) applies the AIC concept to new realizations of a mixed model, conditional on the realized values of the random effects. Lee et al. (2006) and Ha et al (2007) defined a corrected AIC [i.e., AIC(D*) in their eq. 7] which is here interpreted as the conditional AIC.
Such Kullback-Leibler relative distances cannot generally be evaluated exactly and various estimates have been discussed.
get_any_IC
computes, optionally prints, and returns invisibly one or more of the following quantities: (1) Akaike's classical AIC (marginal AIC, mAIC
); (2) a plug-in estimate (cAIC
) and/or a bootstrap estimate (b_cAIC
) of the conditional AIC; and (3) a focussed AIC for dispersion parameters (dispersion AIC, dAIC
).
For the conditional AIC, Vaida and Blanchard's plug-in estimator involves the conditional likelihood, and degrees of freedom for (i) estimated residual error parameters and (ii) the overall linear predictor characterized by the Effective degrees of freedom already discussed by previous authors including Lee and Nelder (1996), which gave a plug-in estimator (\(p_D\)) for it in HGLMs.
By default, the plug-in estimate of both the conditional AIC and of \(n-p_D\) (GoFdf
, where \(n\) is the length of the response vector) are returned by get_any_IC
. But these are biased estimates of conditional AIC and effective df, and an alternative procedure is available if a non-default positive nsim
value is used. In that case, the conditional AIC is estimated by a bootstrap version of Saefken et al. (2014)'s equation 2.5; this involves refitting the model to each bootstrap samples, so it may take time, and a full cross-validation procedure might as well be considered for model selection.
The dispersion AIC has been defined from restricted likelihood by Ha et al (2007; eq.10). The present implementation will use restricted likelihood only if made available by an REML fit, otherwise marginal likelihood is used.