A cforest
is a random forest based on conditional inference
trees, using the implementation in the party package.
These trees can be used for classification, regression or survival
analysis, but only the survival part has been properly tested so far.
fit_cforest(x, y, formula = y ~ ., ctrl_fun = party::cforest_unbiased, ...)
Dataset, observations as rows and descriptors as columns.
Responses.
Formula linking response to descriptors.
Which control function to use, see cforest_control
.
Sent to the function specified by ctrl_fun
.
A fitted cforest
model.
The parameters to cforest
are set using a
cforest_control
object. You should read the documentation
as the default values are chosen for technical reasons, not predictive
performance!
Pay special attention to mtry
which is set very low by default.
Torsten Hothorn, Peter Buehlmann, Sandrine Dudoit, Annette Molinaro and Mark Van Der Laan (2006). Survival Ensembles. Biostatistics, 7(3), 355--373.
Carolin Strobl, Anne-Laure Boulesteix, Achim Zeileis and Torsten Hothorn (2007). Bias in Random Forest Variable Importance Measures: Illustrations, Sources and a Solution. BMC Bioinformatics, 8(25). URL http://www.biomedcentral.com/1471-2105/8/25.
Carolin Strobl, Anne-Laure Boulesteix, Thomas Kneib, Thomas Augustin and Achim Zeileis (2008). Conditional Variable Importance for Random Forests. BMC Bioinformatics, 9(307). URL http://www.biomedcentral.com/1471-2105/9/307.