gains (version 1.2)

MineThatData: MineThatData E-Mail Analytics Challenge Data With Predictions

Description

This data set contains information about purchases from an apparel company during a two-week response window. It is based on a dataset used for an analytics challenge on the MineThatData blog in 2008. Predictions from two different binary response models and two different spend models, conditional on response, are included. Finally, there is an indicator for each customer indicating whether the customer was used in the training sample for the models.

Usage

MineThatData

Arguments

Format

a data frame containing 64000 rows and 7 columns.

  • conversion: 0/1 indicator of whether the customer purchased merchandise in the two-week response window.

  • spend: Amount spent in dollars during the two-week response window.

  • train: 0/1 indicator of whether the observation was used to construct the predictive models.

  • logistic.score: Estimated response probability from a logistic regression.

  • svm.score: Estimated response probability from a support vector machine.

  • linear.score: Estimated revenue ("spend"), conditional on purchase, from the linear regression.

  • rf.score: Estimated "spend", conditional on purchase, from the random forest.