The Pearson chi-square can be used to assess the fit of N-mixture
models. Instead of relying on the theoretical distribution of the
chi-square, a parametric bootstrap approach is implemented to obtain
P-values with the 'parboot' function of the unmarked package.
'Nmix.chisq' computes the observed chi-square statistic based on the
observed and expected values from the model. 'Nmix.gof.test' calls
internally 'Nmix.chisq' and 'parboot' to generate simulated data sets
based on the model and compute the chi-square test statistic.It is also possible to obtain an estimate of the overdispersion
parameter (c-hat) for the model at hand by dividing the observed
chi-square statistic by the mean of the statistics obtained from
simulation (MacKenzie and Bailey 2004, McKenny et al. 2006). This method
of estimating c-hat is similar to the one implemented for
capture-mark-recapture models in program MARK (White and Burnham 1999).
Note that values of c-hat > 1 indicate overdispersion (variance >
mean). Values much higher than 1 (i.e., > 4) probably indicate
lack-of-fit. In cases of moderate overdispersion, one usually
multiplies the variance-covariance matrix of the estimates by c-hat. As
a result, the SE's of the estimates are inflated (c-hat is also known as
a variance inflation factor).
In model selection, c-hat should be estimated from the global model and
the same value of c-hat applied to the entire model set. Specifically,
a global model is the most complex model from which all the other models
of the set are simpler versions (nested). When no single global model
exists in the set of models considered, such as when sample size does
not allow a complex model, one can estimate c-hat from 'subglobal'
models. Here, 'subglobal' models denote models from which only a subset
of the models of the candidate set can be derived. In such cases, one
can use the smallest value of c-hat for model selection (Burnham and
Anderson 2002).
Note that c-hat counts as an additional parameter estimated and should
be added to K. All functions in package 'AICcmodavg' automatically add 1
when the 'c.hat' argument > 1 and apply the same value of c-hat for the
entire model set. When c-hat > 1, functions compute quasi-likelihood
information criteria (either QAICc or QAIC, depending on the value of
the 'second.ord' argument) by scaling the log-likelihood of the model by
c-hat. The value of c-hat can influence the ranking of the models: as
c-hat increases, QAIC or QAICc will favor models with fewer parameters.
As an additional check against this potential problem, one can generate
several model selection tables by incrementing values of c-hat to assess
the model selection uncertainty. If ranking changes little up to the
c-hat value observed, one can be confident in making inference.
In cases of underdispersion (c-hat < 1), it is recommended to keep the
value of c-hat to 1. However, note that values of c-hat << 1 can also
indicate lack-of-fit and that an alternative model should be investigated.