Generated data set having a binary or 4-level Y variable and
a two-dimensional X (first two levels resemble bananas).
Both the train and the test set have 2000 samples in the binary case,
and 4000 in the multi-class case.
They were generated by the authors and their collaborators.