MixedContContIT: Fits (univariate) mixed-effect models to assess surrogacy in the continuous-continuous case based on the Information-Theoretic framework

Description

The function MixedContContIT uses the information-theoretic approach (Alonso & Molenberghs, 2007) to estimate trial- and individual-level surrogacy based on mixed-effect models when both S and T are continuous endpoints. The user can specify whether a (weighted or unweighted) full, semi-reduced, or reduced model should be fitted. See the Details section below.

Usage

MixedContContIT(Dataset, Surr, True, Treat, Trial.ID, Pat.ID,
Model=c("Full"), Weighted=TRUE, Min.Trial.Size=2, Alpha=.05, ...)

Arguments

Dataset

A data.frame that should consist of one line per patient. Each line contains (at least) a surrogate value, a true endpoint value, a treatment indicator, a patient ID, and a trial ID.

Surr

The name of the variable in Dataset that contains the surrogate endpoint values.

True

The name of the variable in Dataset that contains the true endpoint values.

Treat

The name of the variable in Dataset that contains the treatment indicators. The treatment indicator should either be coded as $1$ for the experimental group and $-1$ for the control group, or as $1$ for the experimental group and $0$ for the control group.

Trial.ID

The name of the variable in Dataset that contains the trial ID to which the patient belongs.

Pat.ID

The name of the variable in Dataset that contains the patient's ID.

Model

The type of model that should be fitted, i.e., Model=c("Full"), Model=c("Reduced"), or Model=c("SemiReduced"). See the Details section below. Default Model=c("Full").

Weighted

Logical. In practice it is often the case that different trials (or other clustering units) have different sample sizes. Univariate models are used to assess surrogacy in the information-theoretic approach, so it can be useful to adjust for heterogeneity in information content between the trial-specific contributions (particularly when trial-level surrogacy measures are of primary interest and when the heterogeneity in sample sizes is large). If Weighted=TRUE, weighted regression models are fitted. If Weighted=FALSE, unweighted regression analyses are conducted. See the Details section below. Default TRUE.

Min.Trial.Size

The minimum number of patients that a trial should contain to be included in the analysis. If the number of patients in a trial is smaller than the value specified by Min.Trial.Size, the data of the trial are excluded from the analysis. Default $2$.

Alpha

The $\alpha$-level that is used to determine the confidence intervals around $R^2_{h}$ and $R^2_{ht}$. Default $0.05$.

…

Other arguments to be passed to the function lmer (of the R package lme4) that is used to fit the geralized linear mixed-effect models in the function BimixedContCont.

Value

An object of class MixedContContIT with components,

Data.Analyze

Prior to conducting the surrogacy analysis, data of patients who have a missing value for the surrogate and/or the true endpoint are excluded. In addition, the data of trials (i) in which only one type of the treatment was administered, and (ii) in which either the surrogate or the true endpoint was a constant (i.e., all patients within a trial had the same surrogate and/or true endpoint value) are excluded. In addition, the user can specify the minimum number of patients that a trial should contain in order to include the trial in the analysis. If the number of patients in a trial is smaller than the value specified by Min.Trial.Size, the data of the trial are excluded. Data.Analyze is the dataset on which the surrogacy analysis was conducted.

Obs.Per.Trial

A data.frame that contains the total number of patients per trial and the number of patients who were administered the control treatment and the experimental treatment in each of the trials (in Data.Analyze).

Trial.Spec.Results

A data.frame that contains the trial-specific intercepts and treatment effects for the surrogate and the true endpoints (when a full or semi-reduced model is requested), or the trial-specific treatment effects for the surrogate and the true endpoints (when a reduced model is requested).

R2ht

A data.frame that contains the trial-level surrogacy estimate and its confidence interval.

R2h.ind

A data.frame that contains the individual-level surrogacy estimate and its confidence interval.

Cor.Endpoints

A data.frame that contains the correlations between the surrogate and the true endpoint in the control treatment group (i.e., $\rho_{T0S0}$) and in the experimental treatment group (i.e., $\rho_{T1S1}$), their standard errors and their confidence intervals.

Residuals

A data.frame that contains the residuals for the surrogate and true endpoints ($\varepsilon_{Sij}$ and $\varepsilon_{Tij}$) that are obtained when models (1) or models (2) are fitted (see the Details section above).

Details

Individual-level surrogacy

The following generalised linear mixed-effect models are fitted:

$$g_{T}(E(T_{ij}))=\mu_{T}+m_{Ti}+\beta Z_{ij}+b_{i}Z_{ij},$$ $$g_{T}(E(T_{ij}|S_{ij}))=\theta_{0}+c_{Ti}+\theta_{1}Z_{ij}+a_{i}Z_{ij}+\theta_{2i}S_{ij},$$

where $i$ and $j$ are the trial and subject indicators, $g_{T}$ is an appropriate link function (i.e., an identity link when a continuous true endpoint is considered), $S_{ij}$ and $T_{ij}$ are the surrogate and true endpoint values of subject $j$ in trial $i$, and $Z_{ij}$ is the treatment indicator for subject $j$ in trial $i$. $\mu_{T}$ and $\beta$ are a fixed intercept and a fixed treatment-effect on the true endpoint, while $m_{Ti}$ and $b_{i}$ are the corresponding random effects. $\theta_{0}$ and $\theta_{1}$ are the fixed intercept and the fixed treatment effect on the true endpoint after accounting for the effect of the surrogate endpoint, and $c_{Ti}$ and $a_i$ are the corresponding random effects.

The $-2$ log likelihood values of the previous models (i.e., $L_{1}$ and $L_{2}$, respectively) are subsequently used to compute individual-level surrogacy (based on the so-called Variance Reduction Factor, VFR; for details, see Alonso & Molenberghs, 2007):

$$R^2_{hind}= 1 - exp \left(-\frac{L_{2}-L_{1}}{N} \right),$$

where $N$ is the number of trials.

Trial-level surrogacy

When a full or semi-reduced model is requested (by using the argument Model=c("Full") or Model=c("SemiReduced") in the function call), trial-level surrogacy is assessed by fitting the following mixed models:

$$S_{ij}=\mu_{S}+m_{Si}+(\alpha+a_{i})Z_{ij}+\varepsilon_{Sij}, (1)$$ $$T_{ij}=\mu_{T}+m_{Ti}+(\beta+b_{i})Z_{ij}+\varepsilon_{Tij}, (1)$$

where $i$ and $j$ are the trial and subject indicators, $S_{ij}$ and $T_{ij}$ are the surrogate and true endpoint values of subject $j$ in trial $i$, $Z_{ij}$ is the treatment indicator for subject $j$ in trial $i$, $\mu_{S}$ and $\mu_{T}$ are the fixed intercepts for S and T, $m_{Si}$ and $m_{Ti}$ are the corresponding random intercepts, $\alpha$ and $\beta$ are the fixed treatment effects on S and T, and $a_{i}$ and $b_{i}$ are the corresponding random effects. The error terms $\varepsilon_{Sij}$ and $\varepsilon_{Tij}$ are assumed to be independent.

When a reduced model is requested by the user (by using the argument Model=c("Reduced") in the function call), the following univariate models are fitted:

$$S_{ij}=\mu_{S}+(\alpha+a_{i})Z_{ij}+\varepsilon_{Sij}, (2)$$ $$T_{ij}=\mu_{T}+(\beta+b_{i})Z_{ij}+\varepsilon_{Tij}, (2)$$

where $\mu_{S}$ and $\mu_{T}$ are the common intercepts for S and T. The other parameters are the same as defined above, and $\varepsilon_{Sij}$ and $\varepsilon_{Tij}$ are again assumed to be independent.

When the user requested that a full model approach is used (by using the argument Model=c("Full") in the function call, i.e., when models (1) were fitted), the following model is subsequently fitted:

$$\widehat{\beta}_{i}=\lambda_{0}+\lambda_{1}\widehat{\mu_{Si}}+\lambda_{2}\widehat{\alpha_i}+\varepsilon_{i}, (3)$$

where the parameter estimates for $\beta_i$, $\mu_{Si}$, and $\alpha_i$ are based on models (1) (see above). When a weighted model is requested (using the argument Weighted=TRUE in the function call), model (3) is a weighted regression model (with weights based on the number of observations in trial $i$). The $-2$ log likelihood value of the (weighted or unweighted) models (3) ($L_1$) is subsequently compared to the $-2$ log likelihood value of an intercept-only model ($\widehat{\beta}_{i}=\lambda_{3}$; $L_0$), and $R^2_{ht}$ is computed based on the Variance Reduction Factor (VFR; for details, see Alonso & Molenberghs, 2007):

$$R^2_{ht}= 1 - exp \left(-\frac{L_1-L_0}{N} \right),$$

where $N$ is the number of trials.

When a semi-reduced or reduced model is requested (by using the argument Model=c("SemiReduced") or Model=c("Reduced") in the function call), the following model is fitted:

$$\widehat{\beta_{i}}=\lambda_{0}+\lambda_{1}\widehat{\alpha_i}+\varepsilon_{i},$$

where the parameter estimates for $\beta_i$ and $\alpha_i$ are based on models (2). The $-2$ log likelihood value of this (weighted or unweighted) model ($L_1$) is subsequently compared to the $-2$ log likelihood value of an intercept-only model ($\widehat{\beta}_{i}=\lambda_{3}$; $L_0$), and $R^2_{ht}$ is computed based on the reduction in the likelihood (as described above).

References

Alonso, A, & Molenberghs, G. (2007). Surrogate marker evaluation from an information theory perspective. Biometrics, 63, 180-186.

Examples

Run this code

# NOT RUN {
# Example 1
# Based on the ARMD data:
data(ARMD)
# Assess surrogacy based on a full mixed-effect model
# in the information-theoretic framework:
Sur <- MixedContContIT(Dataset=ARMD, Surr=Diff24, True=Diff52, Treat=Treat, Trial.ID=Center,
Pat.ID=Id, Model="Full")
# Obtain a summary of the results:
summary(Sur)

# }
# NOT RUN {
 # Time consuming (>5sec) code part
# Example 2
# Conduct an analysis based on a simulated dataset with 2000 patients, 200 trials,
# and Rindiv=Rtrial=.8
# Simulate the data:
Sim.Data.MTS(N.Total=2000, N.Trial=200, R.Trial.Target=.8, R.Indiv.Target=.8,
Seed=123, Model="Full")
# Assess surrogacy based on a full mixed-effect model
# in the information-theoretic framework:
Sur2 <- MixedContContIT(Dataset=Data.Observed.MTS, Surr=Surr, True=True, Treat=Treat,
Trial.ID=Trial.ID, Pat.ID=Pat.ID, Model="Full")

# Show a summary of the results:
summary(Sur2)
# }

Run the code above in your browser using DataLab