Artificial data that can be used for unit-testing or teaching (fairness & AI bias)
create_data_unfair(
obs = 1000,
target_name = "target_ind",
factorise_target = FALSE,
target1_prob = 0.25,
add_id = FALSE,
seed = 123
)
A dataframe
Number of observations
Variable name of target
Should target variable be factorised?
Probability that target = 1
Add an id-variable to data?
Seed for randomization (integer)