normalize
is used for every feature column.
Non numerical features will be left untouched and passed to the result.
For constant features most methods fail, special behaviour for this case is implemented.
normalizeFeatures(task, method = "standardize", exclude = character(0L), range = c(0, 1), on.constant = "quiet")
Task
]
The task.character(1)
]
Normalizing method. Available are:
center: Subtract mean.
scale: Divide by standard deviation.
standardize: Center and scale.
range: Scale to a given range.character
]
Names of the columns to exclude.
The target does not have to be included here.
Default is none.numeric(2)
]
Range for method range.
Default is c(0,1)
.character(1)
]
How should constant vectors be treated? Only used, of method != center,
since this methods does not fail for constant vectors. Possible actions are:
quiet: Depending on the method, treat them quietly:
scale: No division by standard deviation is done, input values.
will be returned untouched.
standardize: Only the mean is subtracted, no division is done.
range: All values are mapped to the mean of the given range.
warn: Same behaviour as quiet, but print a warning message.
stop: Stop with an error.Task
].
normalize
Other eda_and_preprocess: capLargeValues
,
createDummyFeatures
,
dropFeatures
,
mergeSmallFactorLevels
,
removeConstantFeatures
,
summarizeColumns