Data imputation for the merged ENMO data with annotation. The missing values were imputated by the average ENMO over all the valid days for each subject.
data.imputation(workdir, csvInput)
character
Directory where the output needs to be stored. Note that this directory must exist.
character
File name with or without directory for sample information in CSV format. The ENMO data will be read through read.csv(csvInput,header=1) command, and the missing values were imputated by the average ENMO over all the valid days for each subject at each time point. In this package, csvInput = flag_All_studyname_ENMO.data.Xs.csv.
Files were written to the specified sub-directory, named as impu.flag_All_studyname_ENMO.data.Xs.csv, which Xs is the epoch size to which acceleration was averaged (seconds) in GGIR output. This excel file includs the following columns,
accelerometer file name
date recored from the GGIR part2.summary file
IDs recored from the GGIR part2.summary file
date in the format of yyyy-mm-dd
number of hours with valid data recored from the part2_daysummary.csv file in the GGIR output
number of hours of measurement recored from the part2_daysummary.csv file in the GGIR output
day of the week-Day of the week
day of measurement-Day number relative to start of the measurement
new IDs defined as the user-defined function of filename2id(), e.g. substrings of the filename
number of NAs from the 9th to 31th column in the part2_daysummary.csv file in the GGIR output
"M" indicates missing for an invalid day, and "C" indicates completeness for a valid day
number of days of measurement
rank of the measurementday, for example, the value is 1,2,3,4,-3,-2,-1 for measurementday = 1,...,7
number of missing (invalid) days
number of non-missing (valid) days
indicators of missing/nonmissing for all measurement days at the subject level
number of columnns in the non-wearing matrix
number of minutes of non-wearing
If 0 then the person is a nightsleeper (sleep period did not overlap with noon) if value=1 then the person is a daysleeper (sleep period did overlap with noon).
indicator of a key qulity control output. If remove16h7day=1, the day need to be removed. If remove16h7day=0, the day need to be kept.
If duplicate="remove", the accelerometer files will not be used in the data analysis of part5.
number of missing values on the ENMO data before imputation
number of missing values on the ENMO data after imputation
The value is "keep"/"remove", e.g. KEEP="remove" if remove16h7day=1 or duplicate="remove" or ImpuMiss.a>0