A dataset containing age and cause of death, as well as age at disease diagnosis (or start of a condition) for 100,000 simulated persons.
simu_data
A data frame with 100000 rows and 6 variables:
unique identifier of each person
age at start of follow-up (0 for all individuals)
age at end of follow-up (death or censoring)
logical variable (TRUE
= death / FALSE
= censoring)
factor variable with 3 levels: "Alive"
(for those
censored) and "Natural"
and "Unnatural"
(for those dying of natural
and unnatural causes of death, respectively)
age at developing a specific disease or condition for those 32,391 individuals that develop the disease (missing for the remaining 67,609)