Usage
create_and_run_test(train_index, data, dependent, problem = c("classification", "regression"), method = c("randomForest", "rpart"), name, description, data_transform = identity, data_transform_name = "identity")
Arguments
train_index
A vector containing the rows from data
to be used as the training
dependent
The dependent variable: the name of the column containing the prediction goal
problem
Either classification or regression. This influences how the algorithms are trained and what method is used to determine performance
method
The regression or classification method
name
The name of the test. Printed in the test results
description
Optional. A more elaborate description of the test
data_transform
A quoted function name that transforms the data. It should maintain it in data frame form and maintain the dependent variable.
data_transform_name
The name of the data transformation function