h2o.SpeeDRF(x, y, data, key = "", classification = TRUE, nfolds = 0, validation,
holdout.fraction = 0, mtries = -1, ntree = 50, depth = 20, sample.rate = 2/3,
oobee = TRUE, importance = FALSE, nbins = 1024, seed = -1,
stat.type = "ENTROPY", balance.classes = FALSE, verbose = FALSE)
H2OParsedData
object containing the variables in the model.nfolds >= 2
, then validation
must remain empty.H2OParsedData
object indicating the validation dataset used to construct confusion matrix. If left blank, this defaults to the training data when nfolds = 0
.sqrt(ncol(data))
, rounded down to the nearest integer.TRUE
, the algorithm will take longer to finish.)seed = -1
, one will automatically be generated by H2O.H2OSpeeDRFModel
with slots key, data, valid (the validation dataset), and model, where the last is a list of the following components:beta = TRUE
in h2o.init
in order to use this method!This method runs random forest model building on a single node, as opposed to the multi-node implementation in h2o.randomForest
.
H2OSpeeDRFModel
, h2o.randomForest
library(h2o)
localH2O = h2o.init()
irisPath = system.file("extdata", "iris.csv", package = "h2o")
iris.hex = h2o.importFile(localH2O, path = irisPath, key = "iris.hex")
h2o.SpeeDRF(x = c(2,3,4), y = 5, data = iris.hex, ntree = 50, depth = 100)
Run the code above in your browser using DataLab