Learn R Programming

designmatch (version 0.5.4)

lalonde: Lalonde data set

Description

This is one of the data sets from the National Supported Work Demonstration used by Dehejia and Wahba (1999) to evaluate propensity score matching methods. This and other related data sets are available at https://users.nber.org/~rdehejia/nswdata2.html.

Usage

data(lalonde)

Arguments

Format

A data frame with 445 observations, corresponding to 185 treated and 260 control subjects, and 10 variables. The treatment assignment indicator is the first variable of the data frame: treatment (1 = treated; 0 = control). The next 7 columns are the covariates:

  • age, measured in years;

  • education, measured in years;

  • black, indicating race (1 if black, 0 otherwise);

  • hispanic, indicating race (1 if Hispanic, 0 otherwise);

  • married, indicating marital status (1 if married, 0 otherwise);

  • nodegree, indicating high school diploma (1 if no degree, 0 otherwise);

  • re74, real earnings in 1974;

  • re75, real earnings in 1975.

The last variable of the data frame is re78, the real the earnings in 1978.

References

Dehejia, R., and Wahba, S. (1999), "Causal Effects in Nonexperimental Studies: Reevaluating the Evaluation of Training Programs," Journal of the American Statistical Association, 94, 1053-1062.

Lalonde, R. (1986), "Evaluating the Econometric Evaluations of Training Programs," American Economic Review, 76, 604-620.