Learn R Programming

⚠️There's a newer version (1.0.2.1) of this package.Take me there.

imbalance

imbalance provides a set of tools to work with imbalanced datasets: novel oversampling algorithms, filtering of instances and evaluation of synthetic instances.

Installation

You can install imbalance from Github with:

# install.packages("devtools")
devtools::install_github("ncordon/imbalance")

Examples

Run pdfos algorithm on newthyroid1 imbalanced dataset and plot a comparison between attributes.

library("imbalance")
data(newthyroid1)

newSamples <- pdfos(newthyroid1, numInstances = 80)
# Join new samples with old imbalanced dataset
newDataset <- rbind(newthyroid1, newSamples)
# Plot a visual comparison between both datasets
plotComparison(newthyroid1, newDataset, attrs = names(newthyroid1)[1:3], cols = 2, classAttr = "Class")

After filtering examples with neater:

filteredSamples <- neater(newthyroid1, newSamples, iterations = 500)
#> [1] "15 samples filtered by NEATER"
filteredNewDataset <- rbind(newthyroid1, filteredSamples)
plotComparison(newthyroid1, filteredNewDataset, attrs = names(newthyroid1)[1:3])

Copy Link

Version

Install

install.packages('imbalance')

Monthly Downloads

8,752

Version

0.1.1

License

GPL (>= 2) | file LICENSE

Issues

Pull Requests

Stars

Forks

Maintainer

Ignacio Cord<c3><b3>n

Last Published

November 15th, 2017

Functions in imbalance (0.1.1)

pdfos

Probability density function estimation based oversampling
haberman

Haberman's survival data
plotComparison

Plots comparison between the original and the new balanced dataset.
ecoli1

Imbalanced binary ecoli protein localization sites
neater

Fitering of oversampled data based on non-cooperative game theory
newthyroid1

Imbalanced binary thyroid gland data
glass0

Imbalanced binary glass identification
iris0

Imbalanced binary iris dataset
wracog

Wrapper for rapidly converging Gibbs algorithm.
mwmote

Majority weighted minority oversampling technique for imbalance dataset learning
yeast4

Imbalanced binary yeast protein localization sites
imabalace

imabalance: A package to treat imbalanced datasets
trainWrapper

Generic methods to train classifiers
wisconsin

Imbalanced binary breast cancer Wisconsin dataset
racog

Rapidly converging Gibbs algorithm.
rwo

Random walk oversampling