Learn R Programming

RfEmpImp (version 2.1.8)

Multiple Imputation using Chained Random Forests

Description

An R package for multiple imputation using chained random forests. Implemented methods can handle missing data in mixed types of variables by using prediction-based or node-based conditional distributions constructed using random forests. For prediction-based imputation, the method based on the empirical distribution of out-of-bag prediction errors of random forests and the method based on normality assumption for prediction errors of random forests are provided for imputing continuous variables. And the method based on predicted probabilities is provided for imputing categorical variables. For node-based imputation, the method based on the conditional distribution formed by the predicting nodes of random forests, and the method based on proximity measures of random forests are provided. More details of the statistical methods can be found in Hong et al. (2020) .

Copy Link

Version

Install

install.packages('RfEmpImp')

Monthly Downloads

183

Version

2.1.8

License

GPL-3

Issues

Pull Requests

Stars

Forks

Maintainer

Shangzhi Hong

Last Published

October 20th, 2022

Functions in RfEmpImp (2.1.8)

gen.mcar

Generate missing (completely at random) cells in a data set
imp.rfnode.prox

Perform multiple imputation based on the conditional distribution formed using node proximity
conv.factor

Convert variables to factors
imp.rfnode.cond

Perform multiple imputation based on the conditional distribution formed by prediction nodes of random forests
mice.impute.rfpred.emp

Univariate sampler function for continuous variables using the empirical error distributions
mice.impute.rfemp

Univariate sampler function for mixed types of variables for prediction-based imputation, using empirical distribution of out-of-bag prediction errors and predicted probabilities of random forests
query.rf.pred.idx

Identify corresponding observations indexes under the terminal nodes for a random forest model by ranger
RfEmpImp-package

RfEmpImp: Multiple Imputation using Chained Random Forests
mice.impute.rfpred.cate

Univariate sampler function for categorical variables for prediction-based imputation, using predicted probabilities of random forest
imp.rfemp

Perform multiple imputation using the empirical error distributions and predicted probabilities of random forests
mice.impute.rfpred.norm

Univariate sampler function for continuous variables for prediction-based imputation, assuming normality for prediction errors of random forest
rangerCallerSafe

Remove unnecessary arguments for ranger function
mice.impute.rfnode

Univariate sampler function for mixed types of variables for node-based imputation, using predicting nodes of random forests
reg.ests

Get regression estimates for pooled object
query.rf.pred.val

Identify corresponding observed values for the response variable under the terminal nodes for a random forest model by ranger