Testing the equality of the survival distributions in two or more independent groups.
# S3 method for formula
logrank_test(formula, data, subset = NULL, weights = NULL, …)
# S3 method for IndependenceProblem
logrank_test(object, ties.method = c("mid-ranks", "Hothorn-Lausen",
"average-scores"),
type = c("logrank", "Gehan-Breslow", "Tarone-Ware", "Prentice",
"Prentice-Marek", "Andersen-Borgan-Gill-Keiding",
"Fleming-Harrington", "Gaugler-Kim-Liao", "Self"),
rho = NULL, gamma = NULL, …)
a formula of the form y ~ x | block
where y
is a survival
object (see Surv
in package survival),
x
is a factor and block
is an optional factor for
stratification.
an optional data frame containing the variables in the model formula.
an optional vector specifying a subset of observations to be used. Defaults
to NULL
.
an optional formula of the form ~ w
defining integer valued case
weights for each observation. Defaults to NULL
, implying equal
weight for all observations.
a character, the method used to handle ties: the score generating function
either uses mid-ranks ("mid-ranks"
, default), the Hothorn-Lausen
method ("Hothorn-Lausen"
) or averages the scores of randomly broken
ties ("average-scores"
); see ‘Details’.
a character, the type of test: either "logrank"
(default),
"Gehan-Breslow"
, "Tarone-Ware"
, "Prentice"
,
"Prentice-Marek"
, "Andersen-Borgan-Gill-Keiding"
,
"Fleming-Harrington"
, "Gaugler-Kim-Liao"
or "Self"
; see
‘Details’.
a numeric, the type
is "Tarone-Ware"
,
"Fleming-Harrington"
, "Gaugler-Kim-Liao"
or "Self"
; see
‘Details’. Defaults to NULL
, implying 0.5
for
type = "Tarone-Ware"
and 0
otherwise.
a numeric, the type
is
"Fleming-Harrington"
, "Gaugler-Kim-Liao"
or "Self"
; see
‘Details’. Defaults to NULL
, implying 0
.
further arguments to be passed to independence_test
.
logrank_test
provides the weighted logrank test reformulated as a
linear rank test. The family of weighted logrank tests encompasses a large
collection of tests commonly used in the analysis of survival data including,
but not limited to, the standard (unweighted) logrank test, the Gehan-Breslow
test, the Tarone-Ware class of tests, the Prentice test, the Prentice-Marek
test, the Andersen-Borgan-Gill-Keiding test, the Fleming-Harrington class of
tests and the Self class of tests. A general description of these methods is
given by Klein and Moeschberger (2003, Ch. 7). See Let<U+00F3>n and
Zuluaga (2001) for the linear rank test formulation.
The null hypothesis of equality, or conditional equality given block
,
of the survival distribution of y
in the groups defined by x
is
tested. In the two-sample case, the two-sided null hypothesis is alternative = "less"
, the null hypothesis is alternative = "greater"
, the null hypothesis is
If x
is an ordered factor, the default scores, 1:nlevels(x)
, can
be altered using the scores
argument (see
independence_test
); this argument can also be used to coerce
nominal factors to class "ordered"
. In this case, a linear-by-linear
association test is computed and the direction of the alternative hypothesis
can be specified using the alternative
argument. This type of
extension of the standard logrank test was given by Tarone (1975) and later
generalized to general weights by Tarone and Ware (1977).
Let
Three different methods of handling ties are available using
ties.method
: mid-ranks ("mid-ranks"
, default), the
Hothorn-Lausen method ("Hothorn-Lausen"
) and average-scores
("average-scores"
). The first and last method are discussed and
contrasted by Callaert (2003), whereas the second method is defined in Hothorn
and Lausen (2003). The mid-ranks method leads to
The type
argument allows for a choice between some of the most
well-known members of the family of weighted logrank tests, each corresponding
to a particular weight function. The standard logrank test ("logrank"
,
default) was suggested by Mantel (1966), Peto and Peto (1972) and Cox (1972)
and has "Gehan-Breslow"
)
proposed by Gehan (1965) and later extended to "Tarone-Ware"
) discussed by
Tarone and Ware (1977) has "Prentice"
) is another generalization of the Wilcoxon rank-sum test
proposed by Prentice (1978), where
"Prentice-Marek"
) is yet another
generalization of the Wilcoxon rank-sum test discussed by Prentice and Marek
(1979), with
"Andersen-Borgan-Gill-Keiding"
)
suggested by Andersen et al. (1982) is a modified version of the
Prentice-Marek test using
"Fleming-Harrington"
) proposed by
Fleming and Harrington (1991) uses "Gaugler-Kim-Liao"
) discussed by
Gaugler et al. (2007) is a modified version of the Fleming-Harrington
class of tests, replacing "Self"
)
suggested by Self (1991) has
The conditional null distribution of the test statistic is used to obtain
distribution = "asymptotic"
). Alternatively, the
distribution can be approximated via Monte Carlo resampling or computed
exactly for univariate two-sample problems by setting distribution
to
"approximate"
or "exact"
respectively. See
asymptotic
, approximate
and exact
for details.
Andersen, P. K., Borgan, <U+00D8>., Gill, R. and Keiding, N. (1982). Linear nonparametric tests for comparison of counting processes, with applications to censored survival data (with discussion). International Statistical Review 50(3), 219--258.
Breslow, N. (1970). A generalized Kruskal-Wallis test for comparing
Callaert, H. (2003). Comparing statistical software packages: The case of the logrank test in StatXact. The American Statistician 57(3), 214--217.
Cox, D. R. (1972). Regression models and life-tables (with discussion). Journal of the Royal Statistical Society B 34(2), 187--220.
Fleming, T. R. and Harrington, D. P. (1991). Counting Processes and Survival Analysis. New York: John Wiley & Sons.
Gaugler, T., Kim, D. and Liao, S. (2007). Comparing two survival time distributions: An investigation of several weight functions for the weighted logrank statistic. Communications in Statistics -- Simulation and Computation 36(2), 423--435.
Gehan, E. A. (1965). A generalized Wilcoxon test for comparing arbitrarily single-censored samples. Biometrika 52(1--2), 203--223.
Hothorn, T. and Lausen, B. (2003). On the exact distribution of maximally selected rank statistics. Computational Statistics & Data Analysis 43(2), 121--137.
Klein, J. P. and Moeschberger, M. L. (2003). Survival Analysis: Techniques for Censored and Truncated Data, Second Edition. New York: Springer.
Lee, J. W. (1996). Some versatile tests based on the simultaneous use of weighted log-rank statistics. Biometrics 52(2), 721--725.
Let<U+00F3>n, E. and Zuluaga, P. (2001). Equivalence between scores and weighted tests for survival curves. Communications in Statistics -- Theory and Methods 30(4), 591--608.
Mantel, N. (1966). Evaluation of survival data and two new rank order statistics arising in its consideration. Cancer Chemotherapy Reports 50(3), 163--170.
Peto, R. and Peto, J. (1972). Asymptotic efficient rank invariant test procedures (with discussion). Journal of the Royal Statistical Society A 135(2), 185--207.
Prentice, R. L. (1978). Linear rank tests with right censored data. Biometrika 65(1), 167--179.
Prentice, R. L. and Marek, P. (1979). A qualitative discrepancy between censored data rank tests. Biometrics 35(4), 861--867.
Self, S. G. (1991). An adaptive weighted log-rank test with application to cancer prevention and screening trials. Biometrics 47(3), 975--986.
Tarone, R. E. (1975). Tests for trend in life table analysis. Biometrika 62(3), 679--682.
Tarone, R. E. and Ware, J. (1977). On distribution-free tests for equality of survival distributions. Biometrika 64(1), 156--160.
# NOT RUN {
## Example data (Callaert, 2003, Tab.1)
callaert <- data.frame(
time = c(1, 1, 5, 6, 6, 6, 6, 2, 2, 2, 3, 4, 4, 5, 5),
group = factor(rep(0:1, c(7, 8)))
)
## Logrank scores using mid-ranks (Callaert, 2003, Tab.2)
with(callaert,
logrank_trafo(Surv(time)))
## Asymptotic Mantel-Cox test (p = 0.0523)
survdiff(Surv(time) ~ group, data = callaert)
## Exact logrank test using mid-ranks (p = 0.0505)
logrank_test(Surv(time) ~ group, data = callaert, distribution = "exact")
## Exact logrank test using average-scores (p = 0.0468)
logrank_test(Surv(time) ~ group, data = callaert, distribution = "exact",
ties.method = "average-scores")
## Lung cancer data (StatXact 9 manual, p. 213, Tab. 7.19)
lungcancer <- data.frame(
time = c(257, 476, 355, 1779, 355,
191, 563, 242, 285, 16, 16, 16, 257, 16),
event = c(0, 0, 1, 1, 0,
1, 1, 1, 1, 1, 1, 1, 1, 1),
group = factor(rep(1:2, c(5, 9)),
labels = c("newdrug", "control"))
)
## Logrank scores using average-scores (StatXact 9 manual, p. 214)
with(lungcancer,
logrank_trafo(Surv(time, event), ties.method = "average-scores"))
## Exact logrank test using average-scores (StatXact 9 manual, p. 215)
logrank_test(Surv(time, event) ~ group, data = lungcancer,
distribution = "exact", ties.method = "average-scores")
## Exact Prentice test using average-scores (StatXact 9 manual, p. 222)
logrank_test(Surv(time, event) ~ group, data = lungcancer,
distribution = "exact", ties.method = "average-scores",
type = "Prentice")
## Approximative (Monte Carlo) versatile test (Lee, 1996)
rho.gamma <- expand.grid(rho = seq(0, 2, 1), gamma = seq(0, 2, 1))
lee_trafo <- function(y)
logrank_trafo(y, ties.method = "average-scores",
type = "Fleming-Harrington",
rho = rho.gamma["rho"], gamma = rho.gamma["gamma"])
it <- independence_test(Surv(time, event) ~ group, data = lungcancer,
distribution = approximate(B = 10000),
ytrafo = function(data)
trafo(data, surv_trafo = lee_trafo))
pvalue(it, method = "step-down")
# }
Run the code above in your browser using DataLab