This dataset was taken from the UCI Machine Learning Repository https://archive.ics.uci.edu/ml/datasets/Auto+MPG, and processed for the semi-supervised setting (Ryan and Culp, 2015).
data("auto_mpg")
There are two lists that contain partitions from a data frame with 398 observations on the following 9 variables.
mpg
a numeric vector
cylinders
an ordered factor with levels 3
< 4
< 5
< 6
< 8
displacement
a numeric vector
horsepower
a numeric vector
weight
a numeric vector
acceleration
a numeric vector
year
a numeric vector
origin
a factor
This dataset is a slightly modified version of the dataset provided in the StatLib library. In line with the use by Ross Quinlan (1993) in predicting the attribute "mpg", 8 of the original instances were removed because they had unknown values for the "mpg" attribute. "The data concerns city-cycle fuel consumption in miles per gallon, to be predicted in terms of 3 multivalued discrete and 5 continuous attributes." (Quinlan, 1993)
Ryan, K. J., & Culp, M. V. (2015). On semi-supervised linear regression in covariate shift problems. The Journal of Machine Learning Research, 16(1), 3183-3217.
data(auto_mpg)
head(auto_mpg$P1$xL)
Run the code above in your browser using DataLab