Merges factor levels that occur only infrequently into combined levels with a higher frequency.
mergeSmallFactorLevels(
task,
cols = NULL,
min.perc = 0.01,
new.level = ".merged"
)Task, where merged levels are combined into a new level of name new.level.
(Task)
The task.
(character) Which columns to convert. Default is all factor and character columns.
(numeric(1))
The smallest levels of a factor are merged until their combined proportion
w.r.t. the length of the factor exceeds min.perc.
Must be between 0 and 1.
Default is 0.01.
(character(1))
New name of merged level.
Default is “.merged”
Other eda_and_preprocess:
capLargeValues(),
createDummyFeatures(),
dropFeatures(),
normalizeFeatures(),
removeConstantFeatures(),
summarizeColumns(),
summarizeLevels()