A dataset containing age and cause of death, as well as age at disease diagnosis (or start of a condition) for 100,000 simulated persons.
simu_dataA data frame with 100000 rows and 6 variables:
unique identifier of each person
age at start of follow-up (0 for all individuals)
age at end of follow-up (death or censoring)
logical variable (TRUE = death / FALSE = censoring)
factor variable with 3 levels: "Alive" (for those
censored) and "Natural" and "Unnatural" (for those dying of natural
and unnatural causes of death, respectively)
age at developing a specific disease or condition for those 32,391 individuals that develop the disease (missing for the remaining 67,609)