Learn R Programming

Disake (version 1.5)

Disake-package: Discrete associated kernel estimators

Description

Discrete smoothing of probability mass function (p.m.f.) is performed using three discrete associated kernels: DiracDU, Binomial and Discrete Triangular. Two automatic bandwidth selection procedures are implemented: the cross-validation method for the three kernels and the local Bayesian approach for Binomial kernel. Note that DiracDU is used for categorical data, Binomial kernel is appropriate for count data with small or moderate sample sizes, and Discrete Triangular kernel is recommanded for count data with large sample sizes.

Arguments

encoding

UTF-8

pkg

Disake

code

kf

describe

  • Binomial kernel :Let $x\in N:= {0, 1, \ldots }$ and ${S}_x = {0, 1, \ldots, x + 1}$. The Binomial kernel is defined on the support ${S}_x$ by

    $$B_{x,h}(y) = \frac {(x+1)!} {y!(x+1-y)!}\left(\frac{x+h}{x+1}\right)^y\left(\frac{1-h}{x+1}\right)^{(x+1-y)}1_{S_{x}}(y),$$

    where $h\in(0, 1]$ and $1_A$ denotes the indicator function of A. Note that $B_{x,h}$ is the p.m.f. of the Binomial distribution with its number of trials $x+1$ and its success probability $(x+h)/(x+1)$. See Kokonendji and Senga Kiessé (2011).

  • DiracDU kernel :For fixed number of categories $c\in {2,3,...}$, we define ${S}_{c} = {0, 1, \ldots, c-1}$. The DiracDU kernel is defined on ${S}_{c}$ by $$DU_{x,h;c}(y) = (1 - h)1_{{x}}(y)+\frac {h} {c-1}1_{S_{c}\setminus{x}}(y),$$ where $x\in {S}_{c}$ and $h\in(0, 1]$. See Kokonendji and Senga Kiessé (2011), and also Aitchison and Aitken (1976) for multivariate case.
  • Discrete Triangular kernel:For fixed arm $a\in N$, we define ${S}_{x,a} = {x-a,\ldots, x, \ldots, x + a}$. The Discrete Triangular kernel is defined on ${S}_{x,a}$ by $$DT_{x,h;a}(y) = \frac {(a+1)^h - |y-x|^h} {P(a,h)}1_{S_{x,a}}(y),$$ where $x\in N$, $h>0$ and $P(a,h)=(2a+1)(a+1)^h - 2(1+2^h+ \cdots +a^h)$ is the normalizing constant. For $a=0$, the Discrete Triangular kernel $DT_{x,h;0}$ corresponds to the Dirac kernel on $x$; see Kokonendji et al. (2007), and also Kokonendji and Zocchi (2010) for an asymmetric version of Discrete Triangular.
  • The bandwidth selection:Two functions are implemented to select the bandwidth: cross-validation and local Bayesian procedure. The cross-validation technique CVbw is used for DiracDU, Binomial and Discrete Triangular kernels; see Kokonendji and Senga Kiessé (2011). The local Bayesian procedure Baysbw is implemented to select the bandwidth for Binomial kernel; see Zougab et al. (2012).

Details

[object Object]

References

Aitchison, J. and Aitken, C.G.G. (1976). Multivariate binary discrimination by the kernel method, Biometrika 63, 413 - 420.

Kokonendji, C.C. and Senga Kiessé, T. (2011). Discrete associated kernel method and extensions, Statistical Methodology 8, 497 - 516.

Kokonendji, C.C., Senga Kiessé, T. and Zocchi, S.S. (2007). Discrete triangular distributions and non-parametric estimation for probability mass function, Journal of Nonparametric Statistics 19, 241 - 254.

Kokonendji, C.C. and Zocchi, S.S. (2010). Extensions of discrete triangular distribution and boundary bias in kernel estimation for discrete functions, Statistics and Probability Letters 80, 1655 - 1662.

Zougab, N., Adjabi, S. and Kokonendji, C.C. (2012). Binomial kernel and Bayes local bandwidth in discrete functions estimation, Journal of Nonparametric Statistics 24, 783 - 795.