SMOTE_data: Create New SMOTE Units to Balance Data combinations of m + s

Description

This function creates new DMUs to address data imbalances. If the majority class is efficient, it generates new inefficient DMUs by worsering the observed units. Conversely, if the majority class is inefficient, it projects inefficient DMUs to the frontier. Finally, a random selection if performed to keep a proportion of 0.65 for the majority class and 0.35 for the minority class.

Usage

SMOTE_data(data, x, y, RTS = "vrs", balance_data, seed)

Value

It returns a data.frame with the newly created set of DMUs incorporated.

Arguments

data

A data.frame containing the variables used in the model.

x

Column indexes of the input variables in the data.

y

Column indexes of the output variables in the data.

RTS

Text string or number defining the underlying DEA technology / returns-to-scale assumption (default: "vrs"). Accepted values:

0 / "fdh": Free disposability hull, no convexity assumption.

1 / "vrs"

Variable returns to scale, convexity and free disposability.

2 / "drs"

Decreasing returns to scale, convexity, down-scaling and free disposability.

3 / "crs"

Constant returns to scale, convexity and free disposability.

4 / "irs"

Increasing returns to scale (up-scaling, not down-scaling), convexity and free disposability.

5 / "add"

Additivity (scaling up and down, but only with integers), and free disposability.

balance_data

Indicate level of efficient units to achive and the number of efficient and not efficient units.

seed

Integer. Seed for reproducibility.