Usage
select_RF(X, y, drop_fraction, number_selected, mtry_factor, ntree_factor,
min_ntree, num_processors, nodesize)
Arguments
X
A data.frame.
Each column corresponds to a feature vectors.
Could include additional covariates not a part of
the original modules.
drop_fraction
A number between 0 and 1. Percentage of features
dropped at each iteration.
number_selected
Number of features selected by fuzzyforest.
mtry_factor
In the case of regression, mtry is set to
ceiling($\sqrt(p)$*mtry_factor).
In the case of classification, mtry is set to
ceiling((p/3)*mtry_factor). If either
of these numbers
ntree_factor
A number greater than 1. ntree for each
random is ntree_factor times the number
of features. For each random forest, ntree
is set to max(min_ntree,
ntree_factor*p
min_ntree
Minimum number of trees grown in each random forest.
num_processors
Number of processors used to fit random forests.