Data containing two sources of bias, three known confounders, and
100,000 observations. This data is obtained from df_uc_em_source
by removing the columns X and U. The resulting data
corresponds to what a researcher would see in the real-world: a
misclassified exposure, Xstar, and missing data on a confounder
U. As seen in df_uc_em_source
, the true, unbiased
exposure-outcome odds ratio = 2.
df_uc_em
A dataframe with 100,000 rows and 5 columns:
misclassified exposure, 1 = present and 0 = absent
outcome, 1 = present and 0 = absent
1st confounder, 1 = present and 0 = absent
2nd confounder, 1 = present and 0 = absent
3rd confounder, 1 = present and 0 = absent