Learn R Programming

AdaSampling

An R implementation of the AdaSampling algorithm for positive unlabeled and label noise learning

Description

Implements the AdaSampling procedure, a framework for both positive unlabeled learning and learning with class label noise, which wraps around a traditional classifying algorithm. See our publication for details, documentation and examples.

Installation

There are two ways to install the package:

To install from CRAN [https://CRAN.R-project.org/package=AdaSampling]:

install.packages("AdaSampling")

To install from github, use:

devtools::install_github("PengyiYang/AdaSampling", build_vignettes = TRUE)
library(AdaSampling)

Current version of this package includes two functions:

  • adaSample() applies the AdaSampling procedure to reduce noise in the training set,

and subsequently trains a classifier from the new training set.

  • adaSvmBenchmark() which allows the performance of the AdaSampling procedure (with an SVM

classifier) to be compared against the performance of the SVM classifier on its own.

In order to see demonstrations of these two functions, see:

browseVignettes("AdaSampling")

References

  • Yang, P., Ormerod, J., Liu, W., Ma, C., Zomaya, A., Yang, J.(2018) AdaSampling for positive-unlabeled and label noise learning with bioinformatics applications. IEEE Transactions on Cybernetics, [doi:10.1109/TCYB.2018.2816984]

  • Yang, P., Liu, W., Yang, J. (2017). Positive unlabeled learning via wrapper-based adaptive

sampling. Proceedings of the 26th International Joint Conference on Artificial Intelligence (IJCAI), 3273-3279. [fulltext]

Acknowledgement

The initial github repo of the AdaSampling package was put together by Kukulege Dinuka Perera.

Copy Link

Version

Install

install.packages('AdaSampling')

Monthly Downloads

417

Version

1.3

License

GPL-3

Issues

Pull Requests

Stars

Forks

Maintainer

Pengyi Yang

Last Published

May 21st, 2019

Functions in AdaSampling (1.3)

weightedKNN

Implementation of a feature weighted k-nearest neighbour classifier.
brca

Wisconsin Breast Cancer Database (1991)
adaSample

Implementation of AdaSampling for positive unlabelled and label noise learning.
adaSvmBenchmark

Benchmarking AdaSampling efficacy on noisy labelled data.
singleIter

singleIter() applies a single iteraction of AdaSampling procedure. It returns the probabilities of all samples as being a positive (P) or negative (N) instance, as a two column data frame.