Data set from the National Supported Work Demonstration used by Lalonde (1986) and Dehejia and Wahba (1999) to evaluate propensity score methods. This data set is publicly available at https://users.nber.org/~rdehejia/data/.nswdata2.html.
data(lalonde)A data frame with 445 observations, corresponding to 185 treated and 260 control subjects, and 10 variables. The treatment assignment indicator is the first variable of the data frame; the next eight columns are the covariates; the last column is the outcome:
the treatment assignment indicator (1 if treated, 0 otherwise)
a covariate, measured in years
a covariate, measured in years
a covariate indicating race (1 if black, 0 otherwise)
a covariate indicating race (1 if Hispanic, 0 otherwise)
a covariate indicating marital status (1 if married, 0 otherwise)
a covariate indicating high school diploma (1 if no degree, 0 otherwise)
a covariate, real earnings in 1974
a covariate, real earnings in 1975
the outcome, real earnings in 1978
Dehejia, R., and Wahba, S. (1999), "Causal Effects in Nonexperimental Studies: Reevaluating the Evaluation of Training Programs," Journal of the American Statistical Association, 94, 1053-1062.
Lalonde, R. (1986), "Evaluating the Econometric Evaluations of Training Programs," American Economic Review, 76, 604-620.