Artificial data that can be used for unit-testing or teaching
create_data_churn(
obs = 1000,
target_name = "churn",
factorise_target = FALSE,
target1_prob = 0.4,
add_id = FALSE,
seed = 123
)
A dataframe
Number of observations
Variable name of target
Should target variable be factorised?
Probability that target = 1
Add an id-variable to data?
Seed for randomization (integer)