Learn R Programming

daltoolboxdp (version 1.2.737)

bal_subsampling: Subsampling

Description

Subsampling balances class distributions by reducing the representation of majority classes through random under-sampling.

Usage

bal_subsampling(attribute)

Value

A bal_subsampling object.

Arguments

attribute

Character. Name of the target class attribute to balance.

References

Kubat, M., & Matwin, S. (1997). Addressing the Curse of Imbalanced Training Sets: One-Sided Selection. Drummond, C., & Holte, R. (2003). C4.5, Class Imbalance, and Cost Sensitivity.

Examples

Run this code
if (FALSE) {
set.seed(123)
data(iris)
mod_iris <- iris[c(1:50, 51:71, 101:111), ]   # induce imbalance
table(mod_iris$Species)

bal <- bal_subsampling('Species')              # random under-sampling
bal <- daltoolbox::fit(bal, mod_iris)
adjust_iris <- daltoolbox::transform(bal, mod_iris)
table(adjust_iris$Species)                     # all classes at minority count
}

Run the code above in your browser using DataLab