Functions to get "learner" functions for gpe
.
gpe_trees(..., remove_duplicates_complements = TRUE, mtry = Inf,
ntrees = 500, maxdepth = 3L, learnrate = 0.01, parallel = FALSE,
use_grad = TRUE, tree.control = ctree_control(mtry = mtry, maxdepth =
maxdepth))gpe_linear(..., winsfrac = 0.025, normalize = TRUE)
gpe_earth(..., degree = 3, nk = 8, normalize = TRUE, ntrain = 100,
learnrate = 0.1, cor_thresh = 0.99)
Currently not used.
TRUE
. Should rules with complementary or duplicate support be removed?
Number of input variables randomly sampled as candidates at each node for random forest like algorithms. The argument is passed to the tree methods in the partykit
package.
Number of trees to fit. Will not have an effect if tree.control
is used.
Maximum depth of trees. Will not have an effect if tree.control
is used.
Learning rate for methods. Corresponds to the \(\nu\) parameter in Friedman & Popescu (2008).
TRUE
. Should basis functions be found in parallel?
ctree_control
with options for the ctree
function.
Quantile to winsorize linear terms. The value should be in \([0,0.5)\)
TRUE
. Should value be scaled by .4 times the inverse standard deviation? If TRUE
, gives linear terms the same influence as a typical rule.
Maximum degree of interactions in earth
model.
Maximum number of basis functions in earth
model.
Number of models to fit.
A threshold on the pairwise correlation for removal of basis functions. This is similar to remove_duplicates_complements
. One of the basis functions in pairs where the correlation exceeds the threshold is excluded. NULL
implies no exclusion. Setting a value closer to zero will decrease the time needed to fit the final model.
A function that has formal arguments formula, data, weights, sample_func, verbose, family, ...
. The function returns a vector with character where each element is a term for the final formula in the call to cv.glmnet
gpe_trees
provides learners for tree method. Either ctree
or glmtree
from the partykit
package will be used.
gpe_linear
provides linear terms for the gpe
.
gpe_earth
provides basis functions where each factor is a hinge function. The model is estimated with earth
.
Hothorn, T., & Zeileis, A. (2015). partykit: A modular toolkit for recursive partytioning in R. Journal of Machine Learning Research, 16, 3905-3909.
Friedman, J. H. (1991). Multivariate adaptive regression splines. The Annals Statistics, 19(1), 1-67.
Friedman, J. H. (2001). Greedy function approximation: a gradient boosting machine. The Annals of Applied Statistics, 29(5), 1189-1232.
Friedman, J. H. (1993). Fast MARS. Dept. of Statistics Technical Report No. 110, Stanford University.
Friedman, J. H., & Popescu, B. E. (2008). Predictive learning via rule ensembles. The Annals of Applied Statistics, 2(3), 916-954.
Chen T., & Guestrin C. (2016). Xgboost: A scalable tree boosting system. Proceedings of the 22Nd ACM SIGKDD International Conference on Knowledge Discovery and Data Mining. ACM, 2016.