stuatt: Student Attributes from the Strategic Data Project Toolkit
Description
A synthetic dataset of student attributes from the Strategic Data
Project which includes records with errors to practice data cleaning and
implementing business rules for consistency in data.
Usage
stuatt
Arguments
Format
A data frame with 87534 observations on the following 9 variables.
sid
a numeric vector of the unique student ID
school_year
a numeric vector of the school year
male
a numeric vector indicating 1 = male
race_ethnicity
a factor with levels ABHM/OW
birth_date
a numeric vector of the student birthdate
first_9th_school_year_reported
a numeric vector of the first year a student is reported in 9th grade
hs_diploma
a numeric vector
hs_diploma_type
a factor with levels Alternative DiplomaCollege Prep DiplomaStandard Diploma
hs_diploma_date
a factor with levels 12/2/200812/21/20084/14/20084/18/2008 ...
Details
This is the non-clean version of the data to allow for implementing
business rules to clean data.