The first 5 variables are all blood tests which are thought to be sensitive to liver disorders that might arise from excessive alcohol consumption. Each line in the dataset constitutes the record of a single male individual.
data(bupa)
A data frame with 345 rows and 7 variables:
mean corpuscular volume
alkaline phosphotase
alanine aminotransferase
aspartate aminotransferase
gamma-glutamyl transpeptidase
number of half-pint equivalents of alcoholic beverages drunk per day
field created by the BUPA researchers to split the data into train/test sets