rsample::bootstraps()Used by bootstrap_performance().
calc_perf_bootstrap_split(
test_data_split,
trained_model,
outcome_colname,
perf_metric_function,
perf_metric_name,
class_probs,
method,
seed
)a long data frame of performance metrics for rsample::int_pctl()
a single bootstrap of the test set from rsample::bootstraps()
Trained model from caret::train().
Column name as a string of the outcome variable
(default NULL; the first column will be chosen automatically).
Function to calculate the performance metric to
be used for cross-validation and test performance. Some functions are
provided by caret (see caret::defaultSummary()). Defaults: binary
classification = twoClassSummary, multi-class classification =
multiClassSummary, regression = defaultSummary.
The column name from the output of the function
provided to perf_metric_function that is to be used as the performance
metric. Defaults: binary classification = "ROC", multi-class
classification = "logLoss", regression = "RMSE".
Whether to use class probabilities (TRUE for categorical outcomes, FALSE for numeric outcomes).
ML method. Options: c("glmnet", "rf", "rpart2", "svmRadial", "xgbTree").
glmnet: linear, logistic, or multiclass regression
rf: random forest
rpart2: decision tree
svmRadial: support vector machine
xgbTree: xgboost
Random seed (default: NA). Your results will only be
reproducible if you set a seed.
Kelly Sovacool, sovacool@umich.edu