Learn R Programming

eeptools (version 0.9.1)

stuatt: Student Attributes from the Strategic Data Project Toolkit

Description

A synthetic dataset of student attributes from the Strategic Data Project which includes records with errors to practice data cleaning and impelmenting business rules for consistency in data.

Usage

stuatt

Arguments

Format

A data frame with 87534 observations on the following 9 variables.
sid
a numeric vector of the unique student ID
school_year
a numeric vector of the school year
male
a numeric vector indicating 1 = male
race_ethnicity
a factor with levels A B H M/O W
birth_date
a numeric vector of the student birthdate
first_9th_school_year_reported
a numeric vector of the first year a student is reported in 9th grade
hs_diploma
a numeric vector
hs_diploma_type
a factor with levels Alternative Diploma College Prep Diploma Standard Diploma
hs_diploma_date
a factor with levels 12/2/2008 12/21/2008 4/14/2008 4/18/2008 ...

Source

Available from the Strategic Data Project online at http://sdp.cepr.harvard.edu/toolkit-effective-data-use

Details

This is the non-clean version of the data to allow for implementing business rules to clean data.

References

Visit the Strategic Data Project online at: http://sdp.cepr.harvard.edu/

Examples

Run this code
data(stuatt)
head(stuatt)

Run the code above in your browser using DataLab