Data frame with known relationship between responses and predictors useful to illustrate multicollinearity concepts. Created from vi using the code shown in the example.
data(toy)Data frame with 2000 rows and 5 columns.
Columns:
y: response variable generated from a * 0.75 + b * 0.25 + noise.
a: most important predictor of y, uncorrelated with b.
b: second most important predictor of y, uncorrelated with a.
c: generated from a + noise.
d: generated from (a + b)/2 + noise.
These are variance inflation factors of the predictors in toy.
variable vif
b 4.062
d 6.804
c 13.263
a 16.161
Other example_data:
vi,
vi_predictors,
vi_predictors_categorical,
vi_predictors_numeric