One of the datasets used by Dehejia and Wahba in their paper "Causal Effects in Non-Experimental Studies: Reevaluating the Evaluation of Training Programs." Also used as an example dataset in the MatchIt package.
data("lalonde")A data frame with 614 observations on the following 10 variables.
treattreatment indicator; 1 if treated in the National Supported Work Demonstration, 0 if from the Current Population Survey
ageage, a numeric vector.
educyears of education, a numeric vector between 0 and 18.
blacka binary vector, 1 if black, 0 otherwise.
hispana binary vector, 1 if hispanic, 0 otherwise.
marrieda binary vector, 1 if married, 0 otherwise.
nodegreea binary vector, 1 if no degree, 0 otherwise.
re74earnings in 1974, a numeric vector.
re75earnings in 1975, a numeric vector.
re78earnings in 1978, a numeric vector (outcome variable).
This data set has been taken from twang package, with small changes to field descriptions.
Lalonde, R. (1986). Evaluating the econometric evaluations of training programs with experimental data. American Economic Review 76: 604-620.
Dehejia, R.H. and Wahba, S. (1999). Causal Effects in Nonexperimental Studies: Re-Evaluating the Evaluation of Training Programs. Journal of the American Statistical Association 94: 1053-1062.