missForest v1.4


Monthly downloads



Nonparametric Missing Value Imputation using Random Forest

The function 'missForest' in this package is used to impute missing values particularly in the case of mixed-type data. It uses a random forest trained on the observed values of a data matrix to predict the missing values. It can be used to impute continuous and/or categorical data including complex interactions and non-linear relations. It yields an out-of-bag (OOB) imputation error estimate without the need of a test set or elaborate cross-validation. It can be run in parallel to save computation time.



missForest is a nonparametric, mixed-type imputation method for basically any type of data.
Here, we host the R-package "missForest" for the statistical software R.

The method is based on the publication Stekhoven and Bühlmann, 2012. The R package contains a vignette on how to use "missForest" in R including many helpful examples. Upcoming innovations:

  • use of prediction
  • 'real' multiple imputation

Contact me by email: stekhoven@quantik.ch References: Stekhoven, D.J. and Buehlmann, P. (2012), 'MissForest - nonparametric missing value imputation for mixed-type data', Bioinformatics, 28(1) 2012, 112-118, doi: 10.1093/bioinformatics/btr597

Functions in missForest

Name Description
missForest-package Nonparametric Missing Value Imputation using Random Forest
varClass Extract Variable Types from a Dataframe
prodNA Introduce Missing Values Completely at Random
mixError Compute Imputation Error for Mixed-type Data
nrmse Normalized Root Mean Squared Error
missForest Nonparametric Missing Value Imputation using Random Forest
No Results!

Last month downloads


Type Package
Date 2013-12-31
License GPL (>= 2)
URL http://www.r-project.org, https://github.com/stekhoven/missForest
Packaged 2013-12-31 14:28:06 UTC; DSQuantik
NeedsCompilation no
Repository CRAN
Date/Publication 2013-12-31 16:17:04

Include our badge in your README