Data containing one source of bias, three known confounders, and
100,000 observations. This data is obtained from df_om_source
by removing the column Y. The resulting data corresponds to
what a researcher would see in the real-world: a misclassified outcome,
Ystar, and no data on the true outcome. As seen in
df_om_source
, the true, unbiased exposure-outcome odds ratio = 2.
df_om
A dataframe with 100,000 rows and 5 columns:
exposure, 1 = present and 0 = absent
misclassified outcome, 1 = present and 0 = absent
1st confounder, 1 = present and 0 = absent
2nd confounder, 1 = present and 0 = absent
3rd confounder, 1 = present and 0 = absent