Adapted from the KDD-CUP-98 data set concerning data regarding donations made to a national veterans organization.
data("DONOR")A data frame with 19372 observations on the following 50 variables.
Donatea factor with levels No Yes
Donation.Amounta numeric vector
IDa numeric vector
MONTHS_SINCE_ORIGINa numeric vector, number of months donor has been in the database
DONOR_AGEa numeric vector
IN_HOUSEa numeric vector, 1 if person has donated to the charity's ``In House" program
URBANICITYa factor with levels ? C R S T U
SESa factor with levels ? 1 2 3 4, one of five possible codes indicating socioeconomic status
CLUSTER_CODEa factor with levels . 01 02 ... 53, one of 54 possible cluster codes, which are
unique in terms of socioeconomic status,
urbanicity, ethnicity, and other demographic
characteristics
HOME_OWNERa factor with levels H U
DONOR_GENDERa factor with levels A F M U
INCOME_GROUPa numeric vector, but in reality one of 7 possible income groups inferred from demographics
PUBLISHED_PHONEa numeric vector, listed (1) vs not listed (0)
OVERLAY_SOURCEa factor with levels B M N P, source from which the donor was match; B is both sources and N is neither
MOR_HIT_RATEa numeric vector, number of known times donor has responded to a mailed solicitation from a group other than the charity
WEALTH_RATINGa numeric vector, but in reality one of 10 groups based on demographics
MEDIAN_HOME_VALUEa numeric vector, inferred from other variables
MEDIAN_HOUSEHOLD_INCOMEa numeric vector, inferred from other variables
PCT_OWNER_OCCUPIEDa numeric vector, percent of owner-occupied housing near where person lives
PER_CAPITA_INCOMEa numeric vector, of neighborhood in which person lives
PCT_ATTRIBUTE1a numeric vector, percent of residents in person's neighborhood that are male and active military
PCT_ATTRIBUTE2a numeric vector, percent of residents in person's neighborhood that are male and veterans
PCT_ATTRIBUTE3a numeric vector, percent of residents in person's neighborhood that are Vietnam veterans
PCT_ATTRIBUTE4a numeric vector, percent of residents in person's neighborhood that are WW2 veterans
PEP_STARa numeric vector, 1 if has achieved STAR donor status and 0 otherwise
RECENT_STAR_STATUSa numeric vector, 1 if achieved STAR within last 4 years
RECENCY_STATUS_96NKa factor with levels A (active) E (inactive) F (first time) L (lapsing)N (new) S (star donor) as of 1996.
FREQUENCY_STATUS_97NKa numeric vector indicating number of times donated in last period (but period is determined by RECENCY STATUS 96NK)
RECENT_RESPONSE_PROPa numeric vector, proportion of responses to the individual to the number of (card or other) solicitations from the charitable organization since four years ago
RECENT_AVG_GIFT_AMTa numeric vector, average donation from the individual to the charitable organization since four years ago
RECENT_CARD_RESPONSE_PROPa numeric vector, number of times the individual has responded to a card solicitation from the charitable organization since four years ago
RECENT_AVG_CARD_GIFT_AMTa numeric vector, average donation from the individual in response to a card solicitation from the charitable organization since four years ago
RECENT_RESPONSE_COUNTa numeric vector, number of times the individual has responded to a promotion (card or other) from the charitable organization since four years ago
RECENT_CARD_RESPONSE_COUNTa numeric vector, number of times the individual has responded to a card solicitation from the charitable organization since four years ago
MONTHS_SINCE_LAST_PROM_RESPa numeric vector, number of months since the individual has responded to a promotion by the charitable organization
LIFETIME_CARD_PROMa numeric vector, total number of card promotions sent to the individual by the charitable organization
LIFETIME_PROMa numeric vector, total number of promotions sent to the individual by the charitable organization
LIFETIME_GIFT_AMOUNTa numeric vector, total lifetime donation amount from the individual to the charitable organization
LIFETIME_GIFT_COUNTa numeric vector, total number of donations from the individual to the charitable organization
LIFETIME_AVG_GIFT_AMTa numeric vector, lifetime average donation from the individual to the charitable organization
LIFETIME_GIFT_RANGEa numeric vector, difference between maximum and minimum donation amounts from the individual
LIFETIME_MAX_GIFT_AMTa numeric vector
LIFETIME_MIN_GIFT_AMTa numeric vector
LAST_GIFT_AMTa numeric vector
CARD_PROM_12a numeric vector, number of card promotions sent to the individual by the charitable organization in the last 12 months
NUMBER_PROM_12a numeric vector, number of promotions (card or other) sent to the individual by the charitable organization in the last 12 months
MONTHS_SINCE_LAST_GIFTa numeric vector
MONTHS_SINCE_FIRST_GIFTa numeric vector
FILE_AVG_GIFTa numeric vector, same as LIFETIME_AVG_GIFT_AMT
FILE_CARD_GIFTa numeric vector, lifetime average donation from the individual in response to all card solicitations from the charitable organization
Originally, this data was used with the 1998 KDD competition (https://kdd.ics.uci.edu/databases/kddcup98/kddcup98.html). This particular version has been adapted from the version available in SAS Enterprise Miner (http://support.sas.com/documentation/cdl/en/emgsj/61207/PDF/default/emgsj.pdf Appendix 2 for descriptions of variable names). One goal is to determine whether a past donor donated in response to the 97NK mail solicitation and (if so), how much, based on age, gender, most recent donation amount, total gift amount, etc.