SplitUplift

a data frame of interest that contains at least the response and the treatment variables.

data

The desired sample size. p is a value between 0 and 1 expressed as a decimal, it is set to be proportional to the number of observations per group.

Your grouping variables. Generally, for uplift modelling, this should be a vector of treatment and response variables names, e.g. c("treat", "y").

group

Split a dataset into training and validation subsets with respect to the uplift sample distribution.

Sampling

Uplift modeling aims at predicting the causal effect of an action such as a medical
treatment or a marketing campaign on a particular individual, by taking into consideration
the response to a treatment. In order to simplify the task for practitioners in uplift modeling,
we propose a combination of tools that can be separated into the following ingredients:
i) quantization, ii) visualization, iii) feature engineering, iv) feature selection and,
v) model validation. For more details, please read Belbahri et Al. (2019)
<https://dms.umontreal.ca/~murua/research/UpliftRegression.pdf>.

SplitUplift: Split data with respect to uplift distribution

Description

Usage

Arguments

Value

References

Examples