fitgrouped: Estimating parameters of the Weibull distribution fitted to grouped data

Description

Suppose a sample of $n$ independent observations each follows a three-parameter Weibull distribution have been divided into $m$ separate groups of the form $(r_{i-1},r_i]$, for $i=1,\dots,m$. So, the likelihood function is given by $$ L(\Theta)=\frac{n!}{f_{1}!f_{2}!\dots f_{m}!}\prod_{i=1}^{m}\Bigl[F\bigl(r_{i}\big|\Theta\bigr)-F\bigl(r_{i-1}\big|\Theta\bigr)\Bigr]^{f_i},$$ where the $r_0$ is the lower bound of the first group, $r_m$ is the upper bound of the last group, and $f_i$ is the frequency of observations within $i$-th group provided that $n=\sum_{i=1}^{m}f_{i}$. The cdf of a three-parameter Birnbaum-Saunders or Weibull distribution is given by $F(x;\Theta)=1- \exp \Bigl\{-\left(\frac{x-\mu}{\beta } \right)^{\alpha } \Bigr\}$ where $\Theta=(\alpha,\beta,\mu)^T$.

Usage

fitgrouped(r, f, family, method1, starts, method2)

Arguments

A numeric vector of length $m+1$. The first element of $r$ is lower bound of the first group and other $m$ elements are upper bound of the $m$ groups. We note that upper bound of the $(i-1)$-th group is the lower bound of the $i$-th group, for $i=2,\dots,m$. The lower bound of the first group and upper bound of the $m$-th group are chosen arbitrarily.

A numeric vector of length $m$ containing the group's frequency.

family

Can be either "weibull" or "birnbaum-saunders".

method1

A character string determining the method of estimation. It can be one of "aml", "em" and "ml". The short forms "aml", "em", and "ml" are described as follows.

starts

A numeric vector of the initial values for the shape, scale, and location parameters, respectively.

method2

The method for optimizing the log-likelihood function. It invovles one of "BFGS", "Nelder-Mead", "CG", "L-BFGS-B" or "SANN".

Value

A two-part list of objects given by the following:

Estimated parameters of the three-parameter Birnbaum-Saunders or Weibull distribution fitted to the gropued data.
A sequence of goodness-of-fit measures consist of Akaike Information Criterion (AIC), Consistent Akaike Information Criterion (CAIC), Bayesian Information Criterion (BIC), Hannan-Quinn information criterion (HQIC), Anderson-Darling (AD), Chi-square (Chi-square), Cram\'eer-von Misses (CVM), Kolmogorov-Smirnov (KS), and log-likelihood (log-likelihood) statistics.

Details

If the method is "em", then the initial values ("starts") and the log-likelihood optimizing method ("method2") are ignored.

References

G. J. McLachlan and T. Krishnan, 2007. The EM Algorithm and Extensions, John Wiley & Sons.

A. P. Dempster, N. M. Laird, and D. B. Rubin, 1977. Maximum likelihood from incomplete data via the EM algorithm, Journal of the Royal Statistical Society, Series B (methodological), 1-38.

M. Teimouri and A. K. Gupta, 2012. Estimation Methods for the Gompertz<U+2013>Makeham Distribution Under Progressively Type-I Interval Censoring Scheme, National Academy Science Letters, 35(3).

Examples

Run this code

# NOT RUN {
r<-c(0,1,2,3,4,10)
f<-c(2,8,12,15,4)
starts<-c(2,2,0)
fitgrouped(r,f,"birnbaum-saunders","em")
fitgrouped(r,f,"weibull","ml",starts,"CG")
# }

Run the code above in your browser using DataLab