Learn R Programming

healthcareai (version 1.2.4)

healthcareai: healthcareai: a streamlined way to develop and deploy models

Description

healthcareai provides a clean interface to create and compare multiple models on your data and then deploy the model that is most accurate. healthcareai also includes functions for data exploration, data cleaning, and model evaluation.

Arguments

Details

This is done in a three-step process: First, loading, profiling, and feature engineering. Second, developing a model. Third, deploying and monitoring the model.

  1. Load and profile data

  2. Develop a machine learning model

    • Models:

      • LassoDevelopment: Used for regression or classification and does an especially good job with a lot of variables.

      • RandomForestDevelopment: Used for regression or classification and is well suited to non-linear data.

      • XGBoostDevelopment: Used for multi-class classification (problems where there are more than 2 classes). Well suited to non-linear data.

      • LinearMixedModelDevelopment: Best suited for longitudinal data and datasets with less than 100k rows and 50 variables. Can do classification or regression.

    • Performance of Trained Models:

      • Area under the ROC curve or area under the precision-recall curve are used to evaluate the performance of classification models.

      • The mean squared error (MSE) and root mean squared error (RMSE) are used to evaluate the performance of regression problems.

    Note: models are saved in the working directory after creation.

  3. Deploy and Monitor the Machine Learning Model

    • Deploy the Model:

    • Monitoring the model:

      • generateAUC is used to monitor performance over time. This should happen after the predictions can be validated with the result. If you're predicting 30-day readmissions, you can't validate until 30 days have passed since the predictions.

References

http://healthcareai-r.readthedocs.io

http://healthcare.ai