rpart_train

formula

data

weights

A non-negative number for complexity parameter. Any split
that does not decrease the overall lack of fit by a factor of
<code>cp</code> is not attempted. For instance, with anova splitting,
this means that the overall R-squared must increase by <code>cp</code> at
each step. The main role of this parameter is to save computing
time by pruning off splits that are obviously not worthwhile.
Essentially, the user informs the program that any split which
does not improve the fit by <code>cp</code> will likely be pruned off by
cross-validation, and that hence the program need not pursue it.

An integer for the minimum number of observations
that must exist in a node in order for a split to be attempted.

minsplit

An integer for the maximum depth of any node
of the final tree, with the root node counted as depth 0.
Values greater than 30 <code>rpart</code> will give nonsense results on
32-bit machines. This function will truncate <code>maxdepth</code> to 30 in
those cases.

maxdepth

Other arguments to pass to either <code>rpart</code> or <code>rpart.control</code>.

<code>rpart_train</code> is a wrapper for <code>rpart()</code> tree-based models
where all of the model arguments are in the main function.

internal

A common interface is provided to allow users to specify a model without having to remember the different argument names across different functions or computational engines (e.g. 'R', 'Spark', 'Stan', etc).

rpart_train: Decision trees via rpart

Description

Usage

Arguments

Value