Learn R Programming

⚠️There's a newer version (2.2.10) of this package.Take me there.

emil (version 2.2.6)

Evaluation of Modeling without Information Leakage

Description

A toolbox for designing and evaluating predictive models with resampling methods. The aim of this package is to provide a simple and efficient general framework for working with any type of prediction problem, be it classification, regression or survival analysis, that is easy to extend and adapt to your specific setting. Some commonly used methods for classification, regression and survival analysis are included.

Copy Link

Version

Install

install.packages('emil')

Monthly Downloads

1

Version

2.2.6

License

GPL (>= 2)

Issues

Pull Requests

Stars

Forks

Maintainer

Christofer Backlin

Last Published

June 21st, 2016

Functions in emil (2.2.6)

dichotomize

Dichotomize time-to-event data
emil

Introduction to the emil package
as.modeling_procedure

Coerce to modeling procedure
extension

Extending the emil framework with user-defined methods
fit_cforest

Fit conditional inference forest
error_fun

Performance estimation functions
fill

Replace values with something else
factor_to_logical

Convert factors to logicals
fit_caret

Fit a model using the caret package
evaluate

Evaluate a modeling procedure
fit_naive_bayes

Fit a naive Bayes classifier
fit_pamr

Fit nearest shrunken centroids model.
fit_randomForest

Fit random forest.
fit_qda

Fit quadratic discriminant.
fit_svm

Fit a support vector machine
fit_glmnet

Fit elastic net, LASSO or ridge regression model
fit_rpart

Fit a decision tree
fit_coxph

Fit Cox proportional hazards model
fit_lm

Fit a linear model fitted with ordinary least squares
fit_lda

Fit linear discriminant
fit

Fit a model
get_response

Extract the response from a data set
image.resample

Visualize resampling scheme
get_performance

Extract prediction performance
get_importance

Feature (variable) importance of a fitted model
get_tuning

Extract parameter tuning statistics
get_prediction

Extract predictions from modeling results
importance_glmnet

Feature importance extractor for elastic net models
importance_pamr

Feature importance of nearest shrunken centroids.
get_color

Get color palettes
list_method

List all available methods
index_fit

Convert a fold to row indexes of fittdng or test set
indent

Increase indentation
log_message

Print a timestamped and indented log message
impute

Regular imputation
is_constant

Check if an object contains more than one unique value
importance_randomForest

Feature importance of random forest.
is_multi_procedure

Detect if modeling results contains multiple procedures
learning_curve

Learning curve analysis
is_blank

Wrapper for several methods to test if a variable is empty
na_index

Support function for identifying missing values
name_procedure

Get names for modeling procedures
neg_gmpa

Negative geometric mean of class specific predictive accuracy
modeling_procedure

Setup a modeling procedure
mode

Get the most common value
nice_axis

Plots an axis the way an axis should be plotted.
%>%

Pipe operator
notify_once

Print a warning message if not printed earlier
nice_require

Load a package and offer to install if missing
nice_box

Plots a box around a plot
pre_factor_to_logical

Convert factors to logical columns
pre_process

Data preprocessing
pre_impute_df

Impute a data frame
predict_cforest

Predict with conditional inference forest
predict_caret

Predict using a caret method
roc_curve

Calculate ROC curves
resample

Resampling schemes
predict_coxph

Predict using Cox proportional hazards model
vlines

Add vertical or horizontal lines to a plot
weighted_error_rate

Weighted error rate
plot.learning_curve

Plot results from learning curve analysis
predict_lm

Prediction using linear model
plot.Surv

Plot Surv vector
pvalue.crr

Extracts p-value from a competing risk model
predict_naive_bayes

Predict using naive Bayes model
pvalue.cuminc

Extract p-value from a cumulative incidence estimation
pvalue.survdiff

Extracts p-value from a logrank test
pvalue

Extraction of p-value from a statistical test
select

emil and dplyr integration
subresample

Generate resampling subschemes
predict_randomForest

Prediction using random forest.
pre_log_message

Print log message during pre-processing
predict_rpart

Predict using a fitted decision tree
pre_pamr

PAMR adapted dataset pre-processing
pvalue.coxph

Extract p-value from a Cox proportional hazards model
print.preprocessed_data

Print method for pre-processed data
predict_lda

Prediction using already trained prediction model
predict_glmnet

Predict using generalized linear model with elastic net regularization
subtree

Extract a subset of a tree of nested lists
trivial_error_rate

Calculate the trivial error rate
pre_impute_knn

Nearest neighbors imputation
predict_pamr

Prediction using nearest shrunken centroids.
pre_impute

Basic imputation
predict_svm

Predict using support vector machine
predict_qda

Prediction using already trained classifier.
validate_data

Validate a pre-processed data set
tune

Tune parameters of modeling procedures
predict.model

Predict the response of unknown observations
%>%

Pipe operator