smote

Performs oversampling by creating new instances.

Methods for analysis of energy consumption data (electricity, gas,
water) at different data measurement intervals. The package provides feature extraction
methods and algorithms to prepare data for data mining and machine learning
applications. Deatiled descriptions of the methods and their application can be found
in Hopf (2019, ISBN:978-3-86309-669-4) "Predictive Analytics for Energy Efficiency and
Energy Retailing" <doi:10.20378/irbo-54833> and Hopf et al. (2016) <doi:10.1007/s12525-018-0290-9>
"Enhancing energy efficiency in the residential sector with smart meter data analytics".

Konstantin Hopf

SmartMeterAnalytics

Methods for Smart Meter Data Analysis

Andreas Weigert

Ilya Kozlovskiy

Thorsten Staake

smote function

<dl><dt>Variables</dt>
<dd>the <a href="/link/data.frame?package=SmartMeterAnalytics&version=1.1.1" data-mini-rdoc="SmartMeterAnalytics::data.frame">data.frame</a> of independent variables that should be used to create new instances</dd>
<dt>Classes</dt>
<dd>the class labels in the prediction problem</dd>
<dt>subset_use</dt>
<dd>a specific subset only is used for the oversampling. If <a href="/link/NULL?package=SmartMeterAnalytics&version=1.1.1" data-mini-rdoc="SmartMeterAnalytics::NULL">NULL</a>, everything is used.</dd>
<dt>k</dt>
<dd>the number of neigbours for generation</dd>
<dt>use_nearest</dt>
<dd>should only the nearest neighbours be used? (very slow)</dd>
<dt>proportions</dt>
<dd>to which proportion (of the biggest class) should the classes be equalized</dd>
<dt>equalise_with_undersampling</dt>
<dd>should additional undersampling be performed?</dd>
<dt>safe</dt>
<dd>should a safe version of SMOTE be used?</dd></dl>

Arguments

Ilya Kozlovskiy, Konstantin Hopf <a href="/link/konstantin.hopf%40uni-bamberg.de?package=SmartMeterAnalytics&version=1.1.1" data-mini-rdoc="SmartMeterAnalytics::konstantin.hopf@uni-bamberg.de">konstantin.hopf@uni-bamberg.de</a>

Author

Synthetic minority oversampling (SMOTE) — smote

<dl>

<dt>Variables</dt>
<dd>the <a href='https://rdrr.io/r/base/data.frame.html'>data.frame</a> of independent variables that should be used to create new instances</dd>


<dt>Classes</dt>
<dd>the class labels in the prediction problem</dd>


<dt>subset_use</dt>
<dd>a specific subset only is used for the oversampling. If <a href='https://rdrr.io/r/base/NULL.html'>NULL</a>, everything is used.</dd>


<dt>k</dt>
<dd>the number of neigbours for generation</dd>


<dt>use_nearest</dt>
<dd>should only the nearest neighbours be used? (very slow)</dd>


<dt>proportions</dt>
<dd>to which proportion (of the biggest class) should the classes be equalized</dd>


<dt>equalise_with_undersampling</dt>
<dd>should additional undersampling be performed?</dd>


<dt>safe</dt>
<dd>should a safe version of SMOTE be used?</dd>

</dl>

Ilya Kozlovskiy, Konstantin Hopf <a href='mailto:konstantin.hopf@uni-bamberg.de'>konstantin.hopf@uni-bamberg.de</a>

smote: Synthetic minority oversampling (SMOTE)

Description

Usage

Value

Arguments

Author

Details