Merges factor levels that occur only infrequently into combined levels with a higher frequency.
mergeSmallFactorLevels(task, cols = NULL, min.perc = 0.01,
new.level = ".merged")(Task) The task.
(character) Which columns to convert. Default is all factor and character columns.
(numeric(1))
The smallest levels of a factor are merged until their combined proportion
w.r.t. the length of the factor exceeds min.perc.
Must be between 0 and 1.
Default is 0.01.
(character(1))
New name of merged level.
Default is “.merged”
Task, where merged levels are combined into a new level of name new.level.
Other eda_and_preprocess: capLargeValues,
createDummyFeatures,
dropFeatures,
normalizeFeatures,
removeConstantFeatures,
summarizeColumns,
summarizeLevels