This function implements split conformal prediction intervals for RFs. Currently used in rfint().
CQRF(
formula = NULL,
train_data = NULL,
pred_data = NULL,
num_trees = NULL,
min_node_size = NULL,
m_try = NULL,
keep_inbag = TRUE,
intervals = TRUE,
alpha = NULL,
forest_type = "RF",
num_threads = NULL,
interval_type = NULL
)
Object of class formula or character describing the model to fit. Interaction terms supported only for numerical variables.
Training data of class data.frame, matrix, dgCMatrix (Matrix) or gwaa.data (GenABEL). Matches ranger() requirements.
Test data of class data.frame, matrix, dgCMatrix (Matrix) or gwaa.data (GenABEL). Utilizes ranger::predict() to get prediction intervals for test data.
Number of trees.
Minimum number of observations before split at a node.
Number of variables to randomly select from at each split.
Saves matrix of observations and which tree(s) they occur in. Required to be true to generate variance estimates for Ghosal, Hooker 2018 method. *Should not be an option...
Generate prediction intervals or not.
Significance level for prediction intervals.
Determines what type of forest: regression forest vs. quantile regression forest. *Should not be an option...
The number of threads to use in parallel. Default is the current number of cores.
Type of prediction interval to generate.
Options are method = c("two-sided", "lower", "upper")
. Default is method = "two-sided"
.