This dataset, Stroke_df, contains fictional case-control data for ischemic stroke, including exposures, risk factors, and confounders. The dataset includes 16,623 observations and 21 variables, covering demographic details, lifestyle factors, biomarkers, and comorbidities. Some observations contain missing values.
data(Stroke_df)A data frame with 16,623 observations and 21 variables:
Geographic region (factor)
Case indicator for ischemic stroke (numeric)
Sex of the participant (integer)
Age of the participant (integer)
Hypertension or blood pressure measure (numeric)
Smoking status (factor)
Perceived stress indicator (factor)
Waist-to-hip ratio tertiles (factor)
Physical activity indicator (factor)
Weekly alcohol consumption frequency (factor)
Diabetes / HbA1c category (factor)
Cardiac risk factor category (factor)
Alternative Healthy Eating Index tertiles (factor)
ApoB/ApoA ratio tertiles (factor)
Sub-education level (factor)
Mother’s education level (factor)
Father’s education level (factor)
Sub-hypertension indicator (factor)
Waist-to-hip ratio (numeric)
ApoB/ApoA continuous ratio (numeric)
Sample weights (numeric)
The dataset name has been kept as 'Stroke_df' to avoid confusion with other datasets in the R ecosystem. This naming convention helps distinguish this dataset as part of the ForCausality package and assists users in identifying its specific characteristics. The suffix 'df' indicates that the dataset is a data frame. The original content has not been modified in any way.