A training dataset for diagnostic models, containing sample IDs, binary outcomes, and gene expression features.
train_diaA data frame with rows for samples and 22 columns:
character. Unique identifier for each sample.
integer. The binary outcome, where 1 typically represents a positive case and 0 a negative case.
numeric. Gene expression level.
numeric. Gene expression level.
numeric. Gene expression level.
numeric. Gene expression level.
numeric. Gene expression level.
numeric. Gene expression level.
numeric. Gene expression level.
numeric. Gene expression level.
numeric. Gene expression level.
numeric. Gene expression level.
numeric. Gene expression level.
numeric. Gene expression level.
numeric. Gene expression level.
numeric. Gene expression level.
numeric. Gene expression level.
numeric. Gene expression level.
numeric. Gene expression level.
numeric. Gene expression level.
numeric. Gene expression level.
numeric. Gene expression level.
This dataset is used to train machine learning models for diagnosis. The column names starting with 'AC', 'AL', 'LINC', etc., are feature variables.