clmm: Cumulative link mixed models

Description

Fits cumulative link mixed models, i.e. cumulative link models with random effects via the Laplace approximation or the standard and the adaptive Gauss-Hermite quadrature approximation. The functionality in clm is also implemented here. Currently only a single random term is allowed in the location-part of the model.

Usage

clmm(location, scale, nominal, random, data, weights, start, subset,
     na.action, contrasts, Hess = FALSE, model = TRUE, sdFixed,
     link = c("logistic", "probit", "cloglog", "loglog",
     "cauchit", "Aranda-Ordaz", "log-gamma"), lambda,
     doFit = TRUE, control, nAGQ = 1,
     threshold = c("flexible", "symmetric", "equidistant"), ...)

Arguments

location

as in clm.

scale

as in clm.

nominal

as in clm.

random

a factor for the random effects in the location-part of the model.

data

as in clm.

weights

as in clm.

start

initial values for the parameters in the format c(alpha, beta, log(zeta), lambda, log(stDev)) where stDev is the standard deviation of the random effects.

subset

as in clm.

na.action

as in clm.

contrasts

as in clm.

Hess

logical for whether the Hessian (the inverse of the observed information matrix) should be computed. Use Hess = TRUE if you intend to call summary or vcov on the fit and Hess = FALSE in all othe

model

as in clm.

sdFixed

If sdFixed is specified (a positive scalar), a model is fitted where the standard deviation for the random term is fixed at the value of sdFixed. If sdFixed is left unspecified, the standard deviation of the

link

as in clm.

lambda

as in clm.

doFit

as in clm although it can also be one of

c("no",
    "R" "C")

, where "R" use the R-implementation for fitting, "C" (default) use C-implementation for fitting and

control

a call to clmm.control.

threshold

as in clm.

nAGQ

the number of quadrature points to be used in the adaptive Gauss-Hermite quadrature approximation to the marginal likelihood. Defaults to 1 which leads to the Laplace approximation. An odd number of quadrature points is encoura

...

additional arguments are passed on to clm.control and possibly further on to the optimizer, which can lead to surprising error or warning messages when mistyping arguments etc.

Value

If doFit = FALSE the result is an environment representing the model ready to be optimized. If doFit = TRUE the result is an object of class "clmm" with the following components:
stDevthe standard deviation of the random effects.
Niterthe total number of iterations in the Newton updates of the conditional modes of the random effects.
grFacthe grouping factor defining the random effects.
nAGQthe number of quadrature points used in the adaptive Gauss-Hermite Quadrature approximation to the marginal likelihood.
ranefthe conditional modes of the random effects, sometimes referred to as "random effect estimates".
condVarthe conditional variances of the random effects at their conditional modes.
betathe parameter estimates of the location part.
zetathe parameter estimates of the scale part on the log scale; the scale parameter estimates on the original scale are given by exp(zeta).
Alphavector or matrix of the threshold parameters.
Thetavector or matrix of the thresholds.
xivector of threshold parameters, which, given a threshold function (e.g. "equidistant"), and possible nominal effects define the class boundaries, Theta.
lambdathe value of lambda if lambda is supplied or estimated, otherwise missing.
coefficientsthe coefficients of the intercepts (theta), the location (beta), the scale (zeta), and the link function parameter (lambda).
df.residualthe number of residual degrees of freedoms, calculated using the weights.
fitted.valuesvector of fitted values conditional on the values of the random effects. Use predict to get the fitted values for a random effect of zero. An observation here is taken to be each of the scalar elements of the multinomial table and not a multinomial vector.
convergenceTRUE if the optimizer terminates wihtout error and FALSE otherwise.
gradientvector of gradients for the unit-variance random effects at their conditional modes.
optReslist with results from the optimizer. The contents of the list depends on the choice of optimizer.
logLikthe log likelihood of the model at optimizer termination.
Hessianif the model was fitted with Hess = TRUE, this is the Hessian matrix of the parameters at the optimum.
scalemodel.frame for the scale model.
locationmodel.frame for the location model.
nominalmodel.frame for the nominal model.
edfthe (effective) number of degrees of freedom used by the model.
startthe starting values.
methodcharacter, the optimizer.
ythe response variable.
levthe names of the levels of the response variable.
nobsthe (effective) number of observations, calculated as the sum of the weights.
thresholdcharacter, the threshold function used in the model.
estimLambda1 if lambda is estimated in one of the flexible link functions and 0 otherwise.
linkcharacter, the link function used in the model.
callthe matched call.
contrastscontrasts applied to terms in location and scale models.
na.actionthe function used to filter missing data.

Details

There are methods for the standard model-fitting functions, including summary, vcov, profile, plot.profile, confint, anova, logLik, predict and an extractAIC method. A Newton scheme is used to obtain the conditional modes of the random effects for Laplace and AGQ approximations, and a non-linear optimization is performed over the fixed parameter set to get the maximum likelihood estimates. The Newton scheme uses the observed Hessian rather than the expected as is done in e.g. glmer, so results from the Laplace approximation for binomial fits should in general be more precise - particularly for other links than the "logistic". Core parts of the function are implemented in C-code for speed. For binomial fits with a single random factor clmm is now faster than glmer for at least some models. The function calls clm to up an environment and to get starting values.

References

Agresti, A. (2002) Categorical Data Analysis. Second edition. Wiley.

Examples

Run this code

options(contrasts = c("contr.treatment", "contr.poly"))
data(soup)

## More manageable data set:
dat <- subset(soup, as.numeric(as.character(RESP)) <=  24)
dat$RESP <- dat$RESP[drop=TRUE]

m1 <- clmm(SURENESS ~ PROD, random = RESP, data = dat, link="probit",
           Hess = TRUE, method="ucminf", threshold = "symmetric")

m1
summary(m1)
logLik(m1)
vcov(m1)
extractAIC(m1)
anova(m1, update(m1, location = SURENESS ~ 1, Hess = FALSE))
anova(m1, update(m1, random = NULL))

## Use adaptive Gauss-Hermite quadrature rather than the Laplace
## approximation:
update(m1, Hess = FALSE, nAGQ = 3)

## Use standard Gauss-Hermite quadrature:
update(m1, Hess = FALSE, nAGQ = -7)

##################################################################
## Binomial example with data from the lme4-package:
data(cbpp, package = "lme4")
cbpp2 <- rbind(cbpp[,-(2:3)], cbpp[,-(2:3)])
cbpp2 <- within(cbpp2, {
    incidence <- as.factor(rep(0:1, each=nrow(cbpp)))
    freq <- with(cbpp, c(incidence, size - incidence))
})

## Fit with Laplace approximation:
fm1 <- clmm(incidence ~ period, random = herd, weights = freq,
            data = cbpp2, Hess = 1)
summary(fm1)

## Fit with the adaptive Gauss-Hermite quadrature approximation:
fm2 <- clmm(incidence ~ period, random = herd, weights = freq,
            data = cbpp2, Hess = 1, nAGQ = 7)
summary(fm2)

Run the code above in your browser using DataLab