unbalanced (version 2.0)

ubOver: Over-sampling

Description

The function replicates randomly some instances from the minority class in order to obtain a final dataset with the same number of instances from the two classes.

Usage

ubOver(X, Y, k = 0, verbose=TRUE)

Arguments

X
the input variables of the unbalanced dataset.
Y
the response variable of the unbalanced dataset. It must be a binary factor where the majority class is coded as 0 and the minority as 1.
k
defines the sampling method.
verbose
print extra information (TRUE/FALSE)

Value

The function returns a list:
X
input variables
Y
response variable

Details

If K=0: sample with replacement from the minority class until we have the same number of instances in each class. If K>0: sample with replacement from the minority class until we have k-times the orginal number of minority instances.

See Also

ubBalance

Examples

Run this code
library(unbalanced)
data(ubIonosphere)
n<-ncol(ubIonosphere)
output<-ubIonosphere$Class
input<-ubIonosphere[ ,-n]

data<-ubOver(X=input, Y= output)
newData<-cbind(data$X, data$Y)

Run the code above in your browser using DataLab