One of the datasets used by Dehejia and Wahba in their paper "Causal Effects in Non-Experimental Studies: Reevaluating the Evaluation of Training Programs." Also used as an example dataset in the MatchIt package.
data("lalonde")
A data frame with 614 observations on the following 10 variables.
treat
treatment indicator; 1 if treated in the National Supported Work Demonstration, 0 if from the Current Population Survey
age
age, a numeric vector.
educ
years of education, a numeric vector between 0 and 18.
black
a binary vector, 1 if black
, 0 otherwise.
hispan
a binary vector, 1 if hispanic
, 0 otherwise.
married
a binary vector, 1 if married
, 0 otherwise.
nodegree
a binary vector, 1 if no degree, 0 otherwise.
re74
earnings in 1974, a numeric vector.
re75
earnings in 1975, a numeric vector.
re78
earnings in 1978, a numeric vector (outcome variable).
This data set has been taken from twang
package, with small changes to field descriptions.
Lalonde, R. (1986). Evaluating the econometric evaluations of training programs with experimental data. American Economic Review 76: 604-620.
Dehejia, R.H. and Wahba, S. (1999). Causal Effects in Nonexperimental Studies: Re-Evaluating the Evaluation of Training Programs. Journal of the American Statistical Association 94: 1053-1062.