- df
data frame for imputation.
- m
number of sets produced by mice.
- maxit
maximum number of iteration for mice.
- col_miss
name of columns with missing values.
- col_no_miss
character vector. Names of columns without NA.
- col_type
character vector. Vector containing column type names.
- set_cor
Correlation or fraction of featurs using if optimize= False
- set_method
Method used if optimize=False. If NULL default method is used (more in methods_random section ).
- percent_of_missing
numeric vector. Vector contatining percent of missing data in columns for example c(0,1,0,0,11.3,..)
- low_corr
double betwen 0,1 default 0 lower boundry of correlation set.
- up_corr
double between 0,1 default 1 upper boundary of correlation set. Both of these parameters work the same for a fraction of features.
- methods_random
set of methods to chose. Default 'pmm'. If seted on NULL this methods are used predictive mean matching (numeric data) logreg, logistic regression imputation (binary data, factor with 2 levels) polyreg, polytomous regression imputation for unordered categorical data (factor > 2 levels) polr, proportional odds model for (ordered, > 2 levels).
- iter
number of iteration for randomSearch.
- random.seed
random seed.
- optimize
if user wont to optimize.
- correlation
If True correlation is using if Fales fraction of features. Default True.
- return_one
One or many imputed sets will be returned. Default True.
- col_0_1
Decaid if add bonus column informing where imputation been done. 0 - value was in dataset, 1 - value was imputed. Default False. (Works only for returning one dataset).
- verbose
If FALSE function didn't print on console.
- out_file
Output log file location if file already exists log message will be added. If NULL no log will be produced.