predExpectiles: Extreme Expectile Estimation

Description

Computes a point and interval estimate of the expectile at the extreme level (Expectile Prediction).

Usage

predExpectiles(data, tau, tau1, method="LAWS", tailest="Hill", var=FALSE,
               varType="asym-Dep", bias=FALSE, bigBlock=NULL, smallBlock=NULL,
               k=NULL, alpha_n=NULL, alpha=0.05)

Value

A list with elements:

EExpcHat: an estimate of the \(\tau'_n\)-th expecile;
VarExtHat: an estimate of the asymptotic variance of the expectile estimator;
CIExpct: an estimate of the approximate \((1-\alpha)100\%\) confidence interval for \(\tau'_n\)-th expecile.

Arguments

data: A vector of \((1 \times n)\) observations.
tau: A real in \((0,1)\) specifying the intermediate level \(\tau_n\). See Details.
tau1: A real in \((0,1)\) specifying the extreme level \(\tau'_n\). See Details.
method: A string specifying the method used to estimate the expecile. By default est="LAWS" specifies the use of the LAWS based estimator. See Details.
tailest: A string specifying the tail index estimator. By default tailest="Hill" specifies the use of Hill estimator. See Details.
var: If var=TRUE then an estimate of the asymptotic variance of the expectile estimator is computed.
varType: A string specifying the type of asymptotic variance to compute. By default varType="asym-Dep" specifies the variance estimator for serial dependent observations. See Details.
bias: A logical value. By default bias=FALSE specifies that no bias correction is computed. See Details.
bigBlock: An interger specifying the size of the big-block used to estimaste the asymptotic variance. See Details.
smallBlock: An interger specifying the size of the small-block used to estimaste the asymptotic variance. See Details.
k: An integer specifying the value of the intermediate sequence \(k_n\). See Details.
alpha_n: A real in \((0,1)\) specifying the quantile's extreme level to be use in order to estimate the expectile's extreme level.
alpha: A real in \((0,1)\) specifying the confidence level \((1-\alpha)100\%\) of the approximate confidence interval for the expecile at the intermedite level.

Author

Simone Padoan, simone.padoan@unibocconi.it, https://faculty.unibocconi.it/simonepadoan/; Gilles Stupfler, gilles.stupfler@univ-angers.fr, https://math.univ-angers.fr/~stupfler/

Details

For a dataset data of sample size \(n\), an estimate of the \(\tau'_n\)-th expectile is computed. The estimation of the expectile at the extreme level tau1 (\(\tau'_n\)) is meant to be a prediction beyond the observed sample. Two estimators are available: the so-called Least Asymmetrically Weighted Squares (LAWS) based estimator and the Quantile-Based (QB) estimator. The definition of both estimators depends on the estimation of the tail index \(\gamma\). Here, \(\gamma\) is estimated using the Hill estimation (see HTailIndex for details) or in alternative using the the expectile based estimator (see EBTailIndex). The observations can be either independent or temporal dependent. See Section 3.2 in Padoan and Stupfler (2020) for details.

The so-called intermediate level tau or \(\tau_n\) is a sequence of positive reals such that \(\tau_n \to 1\) as \(n \to \infty\). Practically, \(\tau_n \in (0,1)\) is the ratio between N (Numerator) and D (Denominator). Where N is the empirical mean distance of the \(\tau_n\)-th expectile from the observations smaller than it, and D is the empirical mean distance of \(\tau_n\)-th expectile from all the observations.
The so-called extreme level tau1 or \(\tau'_n\) is a sequence of positive reals such that \(\tau'_n \to 1\) as \(n \to \infty\). The value \((1-tau'_n) \in (0,1)\) is meant to be a small tail probability such that \((1-\tau'_n)=1/n\) or \((1-\tau'_n) < 1/n\). It is also assumed that \(n(1-\tau'_n) \to C\) as \(n \to \infty\), where \(C\) is a positive finite constant. Typically, \(C \in (0,1)\) so it is expected that there are no observations in a data sample that are greater than the expectile at the extreme level \(\tau_n'\).
When method='LAWS', then the \(\tau'_n\)-th expectile is estimated using the LAWS based estimator. When method='QB', the expectile is instead estimated using the QB esimtator. The definition of both estimators depend on the estimation of the tail index \(\gamma\). When tailest='Hill' then \(\gamma\) is estimated using the Hill estimator (see HTailIndex). When tailest='ExpBased', then \(\gamma\) is estimated using the expectile based estimator (see EBTailIndex). See Section 3.2 in Padoan and Stupfler (2020) for details.
If var=TRUE then an esitmate of the asymptotic variance of the \(tau'_n\)-th expectile is computed. Notice that the estimation of the asymptotic variance is only available when \(\gamma\) is estimated using the Hill estimator (see HTailIndex). With independent observations the asymptotic variance is estimated by \(\hat{\gamma}^2\), see the remark below Theorem 3.5 in Padoan and Stupfler (2020). This is achieved through varType="asym-Ind". With serial dependent observations the asymptotic variance is estimated by the formula in Throrem 3.5 of Padoan and Stupfler (2020). This is achieved through varType="asym-Dep". See Section 3.2 in Padoan and Stupfler (2020) for details. In this latter case the computation of the serial dependence is based on the "big blocks seperated by small blocks" techinque which is a standard tools in time series, see e.g. Leadbetter et al. (1986). The size of the big and small blocks are specified by the parameters bigBlock and smallBlock, respectively.
If bias=TRUE then \(\gamma\) is estimated using formula (4.2) of Haan et al. (2016). This is used by the LAWS and QB estimators. Furthermore, the \(\tau'_n\)--th quantile is estimated using the formula in page 330 of de Haan et al. (2016). This provides a bias corrected version of the Weissman estimator. This is used by the QB estimator. However, in this case the asymptotic variance is not estimated using the formula in Haan et al. (2016) Theorem 4.2. Instead, for simplicity the asymptotic variance is estimated by the formula in Corollary 3.8, with serial dependent observations, and \(\hat{\gamma}^2\) with independent observation (see e.g. de Drees 2000, for the details).
k or \(k_n\) is the value of the so-called intermediate sequence \(k_n\), \(n=1,2,\ldots\). Its represents a sequence of positive integers such that \(k_n \to \infty\) and \(k_n/n \to 0\) as \(n \to \infty\). Practically, when tau=NULL and method='LAWS', then \(\tau_n=1-k_n/n\) is the intermediate level of the expectile to be stimated. The latter is also used to estimate the tail index when tailest='ExpBased'. Instead, if tailest='Hill', then \(k_n\) specifies the number of k\(+1\) larger order statistics used in the definition of the Hill estimator. Differently, When tau=NULL and method='QB', then \(\tau_n=1-k_n/n\) is the intermediate level of the quantile to be stimated and of the expectile to be stimated when tailest='ExpBased'. Instead, when tailest='Hill' it is the numer of k\(+1\) larger order statistics used in the definition of the Hill estimator.
If quantile's extreme level is provided by alpha_n, then expectile's extreme level \(tau'_n(\alpha_n)\) is replaced by \(tau'_n(\alpha_n)\) which is esitmated using the method described in Section 6 of Padoan and Stupfler (2020). See estExtLevel for details.
Given a small value \(\alpha\in (0,1)\) then an estimate of an asymptotic confidence interval for \(tau'_n\)-th expectile, with approximate nominal confidence level \((1-\alpha)100\%\), is computed. The confidence intervals are computed exploiting formula (10) and (11) in Padoan and Stupfler (2020) and (46) in Drees (2003). See Section 5 in Padoan and Stupfler (2020) for details. When biast=TRUE confidence intervals are computed in the same way but after correcting the tail index estimate by an estimate of the bias term, see formula (4.2) in de Haan et al. (2016) for details.

References

Anthony C. Davison, Simone A. Padoan and Gilles Stupfler (2023). Tail Risk Inference via Expectiles in Heavy-Tailed Time Series, Journal of Business & Economic Statistics, 41(3) 876-889.

Daouia, A., Girard, S. and Stupfler, G. (2018). Estimation of tail risk based on extreme expectiles. Journal of the Royal Statistical Society: Series B, 80, 263-292.

de Haan, L., Mercadier, C. and Zhou, C. (2016). Adapting extreme value statistics tonancial time series: dealing with bias and serial dependence. Finance and Stochastics, 20, 321-354.

Drees, H. (2003). Extreme quantile estimation for dependent data, with applications to finance. Bernoulli, 9, 617-657.

Drees, H. (2000). Weighted approximations of tail processes for \(\beta\)-mixing random variables. Annals of Applied Probability, 10, 1274-1301.

Leadbetter, M.R., Lindgren, G. and Rootzen, H. (1989). Extremes and related properties of random sequences and processes. Springer.

Examples

Run this code

# Extreme expectile estimation at the extreme level tau1 obtained with
# 1-dimensional data simulated from an AR(1) with univariate
# Student-t distributed innovations

tsDist <- "studentT"
tsType <- "AR"

# parameter setting
corr <- 0.8
df <- 3
par <- c(corr, df)

# Big- small-blocks setting
bigBlock <- 65
smallBlock <- 15

# Intermediate level (or sample tail probability 1-tau)
tau <- 0.95
# Extreme level (or tail probability 1-tau1 of unobserved expectile)
tau1 <- 0.9995

# sample size
ndata <- 2500

# Simulates a sample from an AR(1) model with Student-t innovations
data <- rtimeseries(ndata, tsDist, tsType, par)

# Extreme expectile estimation
expectHat1 <- predExpectiles(data, tau, tau1, var=TRUE, bigBlock=bigBlock,
	                         smallBlock=smallBlock)
expectHat1$EExpcHat
expectHat1$CIExpct
# Extreme expectile estimation with bias correction
tau <- 0.80
expectHat2 <- predExpectiles(data, tau, tau1, "QB", var=TRUE, bias=TRUE, bigBlock=bigBlock,
							 smallBlock=smallBlock)
expectHat2$EExpcHat
expectHat2$CIExpct

Run the code above in your browser using DataLab