eeptools (version 1.2.2)

stuatt: Student Attributes from the Strategic Data Project Toolkit

Description

A synthetic dataset of student attributes from the Strategic Data Project which includes records with errors to practice data cleaning and implementing business rules for consistency in data.

Usage

stuatt

Arguments

Format

A data frame with 87534 observations on the following 9 variables.

sid

a numeric vector of the unique student ID

school_year

a numeric vector of the school year

male

a numeric vector indicating 1 = male

race_ethnicity

a factor with levels A B H M/O W

birth_date

a numeric vector of the student birthdate

first_9th_school_year_reported

a numeric vector of the first year a student is reported in 9th grade

hs_diploma

a numeric vector

hs_diploma_type

a factor with levels Alternative Diploma College Prep Diploma Standard Diploma

hs_diploma_date

a factor with levels 12/2/2008 12/21/2008 4/14/2008 4/18/2008 ...

Details

This is the non-clean version of the data to allow for implementing business rules to clean data.

References

Visit the Strategic Data Project online at: http://sdp.cepr.harvard.edu/

Examples

Run this code
# NOT RUN {
data(stuatt)
head(stuatt)
# }

Run the code above in your browser using DataCamp Workspace