Split a data.frame into training and test sets.
data_split(
data = get_data("german"),
varname = "credit_risk",
p_test = 0.2,
p_quiz = 0.5
)
data.frame
string. output variable name
real. proportion of samples in the test set
real. proportion of samples from the test set in the quiz set
list with members
training set with output variable
test set without output variable
test set output variable
indices of quiz samples in the test set