By which statistic ("pd", "pred_mean", "y_mean", "resid_mean", "ale")
should the results be sorted? The default is "no" (no sorting). Calculated
after all other update steps, e.g., after collapsing or dropping rare levels.
to_factor
Should discrete features be treated as factors?
In combination with collapse_m, this can be used to collapse rare values of
discrete numeric features.
collapse_m
If a factor or character feature has more than collapse_m levels,
rare levels are collapsed into a new level "other". Standard deviations are
collapsed via root of the weighted average variances. The default is 30.
Set to Inf for no collapsing.
collapse_by
How to determine "rare" levels in collapse_m?
Either "weight" (default) or "N". Only matters in situations with case weights w.
drop_empty
Drop empty bins. Equivalent to drop_below_n = 1.
drop_below_n
Drop bins with N below this value. Applied after collapsing.
drop_below_weight
Drop bins with weight below this value. Applied after
collapsing.
na.rm
Should missing bin centers be dropped? Default is FALSE.