Learn R Programming

VSURF (version 1.2.0)

Variable Selection Using Random Forests

Description

Three steps variable selection procedure based on random forests. Initially developed to handle high dimensional data (for which number of variables largely exceeds number of observations), the package is very versatile and can treat most dimensions of data, for regression and supervised classification problems. First step is dedicated to eliminate irrelevant variables from the dataset. Second step aims to select all variables related to the response for interpretation purpose. Third step refines the selection by eliminating redundancy in the set of variables selected by the second step, for prediction purpose. Genuer, R. Poggi, J.-M. and Tuleau-Malot, C. (2015) .

Copy Link

Version

Install

install.packages('VSURF')

Monthly Downloads

560

Version

1.2.0

License

GPL (>= 2)

Issues

Pull Requests

Stars

Forks

Maintainer

Robin Genuer

Last Published

December 15th, 2022

Functions in VSURF (1.2.0)

PM10

Real-world data on PM10 pollution in Rouen area, France
print.VSURF

Print of VSURF results
predict.VSURF

Predict method for VSURF object
VSURF

Variable Selection Using Random Forests
VSURF_interp

Interpretation step of VSURF
VSURF_thres

Thresholding step of VSURF
VSURF_pred

Prediction step of VSURF
plot.VSURF

Plot of VSURF results
toys

A simulated dataset called toys data
summary.VSURF

Summary of VSURF results
tune

Tuning of the thresholding and interpretation steps of VSURF