Load and formats the student registration data set from the OULAD for data analysis.
dataset_registration(
module = c("All", "AAA", "BBB", "CCC", "DDD", "EEE", "FFF", "GGG"),
presentation = c("2013B", "2014B", "2013J", "2014J", "All", "Summer", "Winter"),
repeat_students = c("remove", "keep")
)Returns one tibble (object of class tbl_df), called 'studentRegistration', based on the OULAD studentRegistration.csv file
and the specified inputs (module, presentation, and repeat_students).
The tibble consists of five columns (Kuzilek et al., 2017):
id_student - the unique student identification number.
code_module - module identification code.
code_presentation - module presentation identification code.
date_registration - the day of student’s registration for the module presentation. Modules start on day 0.
date_unregistration - the day of student unregistration from the module presentation. This is NA if the student completed the module presentation.
name of the module to be included, either "All", "AAA", "BBB", "CCC", "DDD", "EEE", "FFF" or "GGG".
name of the semester of the module to be included, either "2013B",
"2014B", "2013J", "2014J", "All", "Summer" or "Winter".
"B" indicates a February start time whereas "J" indicates an October start time. "All" indicates
that all presentations of the module will be included in the returned data. Where possible, "Summer" returns
both "2013B" and "2014B", and "Winter" returns both "2013J" and "2014J".
indicator of whether students who had previous attempts at the module should be removed, either "remove" or "keep".
Kuzilek, J., Hlosta, M., & Zdrahal, Z. (2017). Open university learning analytics data set. Scientific Data volume 4 , (pp. 1–8). https://doi.org/10.1038/sdata.2017.171.
combined_dataset()
dataset_registration(module = "BBB", presentation = "2013J", repeat_students = "remove")
Run the code above in your browser using DataLab