gofKendallCvM: gof test (Cramer-von Mises) based on Kendall's process

Description

gofKendallCvM tests a given dataset for a copula based on Kendall's process with the Cramer-von Mises test statistic. The margins can be estimated by a bunch of distributions and the time which is necessary for the estimation can be given. The possible copulae are "normal", "t", "gumbel", "clayton" and "frank". See for reference Genest et al. (2009). The parameter estimation is performed with pseudo maximum likelihood method. In case the estimation fails, inversion of Kendall's tau is used. The approximate p-values are computed with a parametric bootstrap, which computation can be accelerated by enabling in-build parallel computation.

Usage

gofKendallCvM(copula, x, param = 0.5, param.est = T, df = 4, df.est = T, 
              margins = "ranks", dispstr = "ex", M = 100, 
              execute.times.comp = T, processes = 1)

Arguments

copula

The copula to test for. Possible are the copulae "normal", "t", "clayton", "gumbel" and "frank".

A matrix containing the residuals of the data.

param

The copula parameter to use, if it shall not be estimated.

param.est

Shall be either TRUE or FALSE. TRUE means that param will be estimated.

Degrees of freedom, if not meant to be estimated. Only necessary if tested for "t"-copula.

df.est

Indicates if df shall be estimated. Has to be either FALSE or TRUE, whereTRUE means that it will be estimated.

margins

Specifies which estimation method shall be used in case that the input data are not in the range [0,1]. The default is "ranks", which is the standard approach to convert data in such a case. Alternatively can the following distributions be specified: "beta", "cauchy", Chi-squared ("chisq"), "f", "gamma", Log normal ("lnorm"), Normal ("norm"), "t", "weibull", Exponential ("exp").

dispstr

A character string specifying the type of the symmetric positive definite matrix characterizing the elliptical copula. Implemented structures are "ex" for exchangeable and "un" for unstructured, see package copula.

Number of bootstrap samples.

execute.times.comp

Logical. Defines if the time which the estimation most likely takes shall be computed. It'll be just given if M is at least 100.

processes

The number of parallel processes which are performed to speed up the bootstrapping. Shouldn't be higher than the number of logical processors. Please see the details.

Value

A object of the class gofCOP with the components

method

a character which informs about the performed analysis

erg.tests

a matrix with the p-value and test statistic of test

Details

With the pseudo observations $U_{ij}$ for $i = 1, \dots,n$, $j = 1, \dots,d$ and $\mathbf{u} \in [0,1]^d$ is the empirical copula given by $C_n(\mathbf{u}) = \frac{1}{n} \sum_{i = 1}^n \mathbf{I}(U_{i1} \leq u_1, \dots, U_{id} \leq u_d).$ Let the rescaled pseudo observations be $V_1 = C_n(U_1), \dots, V_n = C_n(U_n)$ and the distribution function of $V$ shall be $K$. The estimated version is given by $$K_n(v) = \frac{1}{n} \sum_{i=1}^n \mathbf{I}(V_i \leq v)$$ with $v \in [0,1]^d.$ The testable $H_0^{'}$ hypothesis is then $$K \in \mathcal{K}_0 = \{K_{\theta} : \theta \in \Theta \}$$ with $\Theta$ being an open subset of $R^p$ for an integer $p \geq 1$, see Genest et al. (2009). The resulting Cramer-von Mises test statistic is then given by $$T = n \int_0^1 (K_n(v) - K_{\theta_n})^2 d K_{\theta_n}(v).$$

Because $H_0^{'}$ consists of more distributions than the $H_0$ is the test not necessarily consistent.

The approximate p-value is computed by the formula

$$\sum_{b=1}^M \mathbf{I}(|T_b| \geq |T|) / M,$$

For small values of M, initializing the parallization via processes does not make sense. The registration of the parallel processes increases the computation time. Please consider to enable parallelization just for high values of M.

References

Christian Genest, Bruno Remillard, David Beaudoin (2009). Goodness-of-fit tests for copulas: A review and a power study. Insurance: Mathematics and Economics, Volume 44, Issue 2, April 2009, Pages 199-213, ISSN 0167-6687. http://dx.doi.org/10.1016/j.insmatheco.2007.10.005 Christian Genest, Jean-Francois Quessy, Bruno Remillard (2002). Tests of serial independence based on Kendall's process. The Canadian Journal of Statistics, Volume 30, Issue 3, Sep. 2002, Pages 441-461. https://www.jstor.org/stable/3316147 cp. Ulf Schepsmeier, Jakob Stoeber, Eike Christian Brechmann, Benedikt Graeler (2015). VineCopula: Statistical Inference of Vine Copulas. R package version 1.4.. https://cran.r-project.org/package=VineCopula

Examples

Run this code

# NOT RUN {
data(IndexReturns)

gofKendallCvM("normal", IndexReturns[c(1:100),c(1:2)], M = 10)
# }

Run the code above in your browser using DataLab