Random data that can be used for unit-testing or teaching
create_data_random(
obs = 1000,
vars = 10,
target_name = "target_ind",
factorise_target = FALSE,
target1_prob = 0.5,
add_id = TRUE,
seed = 123
)
A dataframe
Number of observations
Number of variables
Variable name of target
Should target variable be factorised? (from 0/1 to facotr no/yes)?
Probability that buy = 1
Add an id-variable to data?
Seed for randomization
Variables in dataset:
id = Identifier
var_X = variable containing values between 0 and 100
Target in dataset:
target_ind (may be renamed) = random values (1 = yes, 0 = no)