Confidence interval formulae for p.conf
calculates confidence intervals for the binomial parameter data
vector. Finite population corrections can also be specified.
ci.p(data, conf = 0.95, summarized = FALSE, phat = NULL,
fpc = FALSE, n = NULL, N = NULL, method="agresti.coull", plot = TRUE)
A vector of binary data. Required if summarized = FALSE
.
Level of confidence 1 - P(type I error).
Logical; indicate whether raw data or summary stats are to be used.
Estimate of summarized = TRUE
.
Logical. Indicates whether finite population corrections should be used. If fpc = TRUE
then N
must be specified. Finite population corrections are not possible for method = "exact"
or method = "score"
.
Sample size. Required if summarized = TRUE
.
Population size. Required if fpc = TRUE
.
Type of method to be used in confidence interval calculations, method ="agresti.coull"
is the default. Other procedures include method="asymptotic"
which provides the conventional normal (Wald) approximation,
method = "score"
, method = "LR"
, and method="exact"
(see Details below). Partial names can be used. The "exact"
method cannot be implemented if summarized=TRUE
.
Logical. Should likelihood ratio plot be created with estimate from method = "LR"
.
Returns a list of class = "ci"
.
Estimate for
Estimate for
Confidence margin.
Confidence interval.
For the binomial distribution, the parameter of interest is the probability of success,
Because the sampling distribution of any ML estimator is asymptotically normal, an "asymptotic" 100(1 -
This method has also been called the Wald confidence interval.
These estimators can create extremely inaccurate confidence intervals, particularly for small sample sizes or when
where
As above, the 100(1 -
The likelihood ratio method method = "LR"
finds points in the binomial log-likelihood function where the difference between the maximum likelihood and likelihood function is closest to seq(0.00001, 0.99999, by = 0.00001)
.
The "exact" method of Clopper and Pearson (1934) is bounded at the nominal limits, but actual coverage may be well below this level, particularly when n is small and
Agresti (2012) recommends the Agresti-Coull method over the normal approximation, the score method over the Agresti-Coull method, and the likelihood ratio method over all others. The Clopper Pearson has been repeatedly criticized as being too conservative (Agresti and Coull 2012).
Agresti, A. (2012) Categorical Data Analysis, 3rd edition. New York. Wiley.
Agresti, A., and Coull, B . A. (1998) Approximate is better than 'exact' for interval estimation of binomial proportions. The American Statistician. 52: 119-126.
Clopper, C. and Pearson, S. (1934) The use of confidence or fiducial limits illustrated in the case of the Binomial. Biometrika 26: 404-413.
Ott, R. L., and Longnecker, M. T. (2004) A First Course in Statistical Methods. Thompson.
Wilson, E. B.(1927) Probable inference, the law of succession, and statistical inference. Journal of the American Statistical Association 22: 209-212.
# NOT RUN {
#In 2001, it was estimated that 56,200 Americans would be diagnosed with
# non-Hodgkin's lymphoma and that 26,300 would die from it (Cernan et al. 2002).
# Here we find the 95% confidence interval for the probability of diagnosis, pi.
ci.p(c(rep(0, 56200-26300),rep(1,26300))) # Agresti-Coull
ci.p(c(rep(0, 56200-26300),rep(1,26300)), method = "LR") # Likelihood Ratio
# summarized = TRUE
n = 56200
x = 26300
phat = x/n
ci.p(summarized = TRUE, phat = phat, n = n) # Agresti-Coull
# Use 2001 US population size as N
N <- 285 * 10^6
ci.p(c(rep(0, 56200-26300),rep(1,26300)), fpc = TRUE, N = N) # Agresti-Coull
ci.p(summarized = TRUE, phat = phat, n = n, N = N, fpc = TRUE) # Agresti-Coull
# }
Run the code above in your browser using DataLab