The Information Matrix Test (IMT), proposed by Suveges and Davison (2010), is based
on the difference between the expected quadratic score and the second derivative of
the log-likelihood. The asymptotic distribution for each threshold u
and gap K
is asymptotically \(\chi^2\) with one degree of freedom. The approximation is good for
\(N>80\) and conservative for smaller sample sizes. The test assumes independence between gaps.
infomat.test(x, q, K, plot = TRUE)
data vector
vector of thresholds
int specifying the largest K-gap
logical: should the graphical diagnostic be plotted?
an invisible list of matrices containing
IMT
a matrix of test statistics
pvals
a matrix of approximate p-values (corresponding to probabilities under a \(\chi^2_1\) distribution)
mle
a matrix of maximum likelihood estimates for each given pair (u
, K
)
loglik
a matrix of log-likelihood values at MLE for each given pair (u
, K
)
threshold
a vector of thresholds based on empirical quantiles at supplied levels.
q
the vector q
supplied by the user
The procedure proposed in Suveges & Davison (2010) was corrected for erratas. The maximum likelihood is based on the limiting mixture distribution of the intervals between exceedances (an exponential with a point mass at zero). The condition \(D^{(K)}(u_n)\) should be checked by the user.
Fukutome et al. (2015) propose an ad hoc automated procedure
Calculate the interexceedance times for each K-gap and each threshold, along with the number of clusters
Select the (u
, K
) pairs for which IMT < 0.05 (corresponding to a P-value of 0.82)
Among those, select the pair (u
, K
) for which the number of clusters is the largest
Fukutome, Liniger and Suveges (2015), Automatic threshold and run parameter selection: a climatology for extreme hourly precipitation in Switzerland. Theoretical and Applied Climatology, 120(3), 403-416.
Suveges and Davison (2010), Model misspecification in peaks over threshold analysis. Annals of Applied Statistics, 4(1), 203-221.
White (1982), Maximum Likelihood Estimation of Misspecified Models. Econometrica, 50(1), 1-25.
# NOT RUN {
infomat.test(x <- evd::rgpd(n = 1000), q = seq(0.9,0.995,length=10), K <- 3)
# }
Run the code above in your browser using DataLab