This function creates new DMUs to address data imbalances. If the majority class is efficient, it generates new inefficient DMUs by worsering the observed units. Conversely, if the majority class is inefficient, it projects inefficient DMUs to the frontier. Finally, a random selection if performed to keep a proportion of 0.65 for the majority class and 0.35 for the minority class.
SMOTE_data(data, x, y, RTS = "vrs", balance_data, seed)It returns a data.frame with the newly created set of DMUs incorporated.
A data.frame containing the variables used in the model.
Column indexes of the input variables in the data.
Column indexes of the output variables in the data.
Text string or number defining the underlying DEA technology /
returns-to-scale assumption (default: "vrs"). Accepted values:
0 / "fdh"Free disposability hull, no convexity assumption.
1 / "vrs"Variable returns to scale, convexity and free disposability.
2 / "drs"Decreasing returns to scale, convexity, down-scaling and free disposability.
3 / "crs"Constant returns to scale, convexity and free disposability.
4 / "irs"Increasing returns to scale (up-scaling, not down-scaling), convexity and free disposability.
5 / "add"Additivity (scaling up and down, but only with integers), and free disposability.
Indicate level of efficient units to achive and the number of efficient and not efficient units.
Integer. Seed for reproducibility.