This dataset was taken from the UCI Machine Learning Repository https://archive.ics.uci.edu/ml/datasets/Auto+MPG, and processed for the semi-supervised setting (Ryan and Culp, 2015).
data("auto_mpg")There are two lists that contain partitions from a data frame with 398 observations on the following 9 variables.
mpga numeric vector
cylindersan ordered factor with levels 3 < 4 < 5 < 6 < 8
displacementa numeric vector
horsepowera numeric vector
weighta numeric vector
accelerationa numeric vector
yeara numeric vector
origina factor
This dataset is a slightly modified version of the dataset provided in the StatLib library. In line with the use by Ross Quinlan (1993) in predicting the attribute "mpg", 8 of the original instances were removed because they had unknown values for the "mpg" attribute. "The data concerns city-cycle fuel consumption in miles per gallon, to be predicted in terms of 3 multivalued discrete and 5 continuous attributes." (Quinlan, 1993)
Ryan, K. J., & Culp, M. V. (2015). On semi-supervised linear regression in covariate shift problems. The Journal of Machine Learning Research, 16(1), 3183-3217.
data(auto_mpg)
head(auto_mpg$P1$xL)
Run the code above in your browser using DataLab