Learn R Programming

Overview

Surrogate-guided ensemble Latent Dirichlet Allocation (sureLDA) is a label-free multidimensional phenotyping method. It first uses the PheNorm or MAP algorithm to initialize probabilities based on two surrogate features for each target disease, and then leverages these probabilities to guide the LDA topic model to generate phenotype-specific topics. Finally, it combines phenotype-feature counts with surrogates via clustering ensemble to yield final phenotype probabilities.

See Ahuja et al. JAMIA (2020) for details.

Installation

If devtools is not installed, uncomment the code below and install it from CRAN.

# install.packages("devtools")

Run the code below to install sureLDA from GitHub:

devtools::install_github("celehs/sureLDA")

Getting Started

Click HERE to view a demo with a simulated example.

References

Y. Ahuja, D. Zhou, Z. He, J. Sun, V. M. Castro, V. Gainer, S. N. Murphy, C. Hong, T. Cai. sureLDA: A Multi-Disease Automated Phenotyping Method for the Electronic Health Record. J Am Med Inform Assoc (2020); 27(8): 1235-1243

Copy Link

Version

Install

install.packages('sureLDA')

Monthly Downloads

149

Version

0.1.0-1

License

GPL-3

Issues

Pull Requests

Stars

Forks

Maintainer

Yuri Ahuja

Last Published

November 10th, 2020

Functions in sureLDA (0.1.0-1)

sureLDA-package

sureLDA: A Novel Multi-Disease Automated Phenotyping Method for the Electronic Health Record
simdata

Simulated Dataset
sureLDA

Surrogate-guided ensemble Latent Dirichlet Allocation