Learn R Programming

SAMBA

Health research using data from electronic health records (EHR) has gained popularity, but misclassification of EHR-derived disease status and lack of representativeness of the study sample can result in substantial bias in effect estimates and can impact power and type I error for association tests. Here, the assumed target of inference is the relationship between binary disease status and predictors modeled using a logistic regression model. SAMBA implements several methods for obtaining bias-corrected point estimates along with valid standard errors as proposed in Beesley and Mukherjee (2020), currently under review.

Installation

SAMBA can be downloaded from Github via the R Package devtools

devtools::install_github("umich-cphds/SAMBA", build_opts = c())

Vignette

Once you have SAMBA installed, you can type

vignette("UsingSAMBA")

in R to bring up a tutorial on SAMBA and how to use it.

Questions

For questions and comments about the implementation, please contact Alexander Rix (alexrix@umich.edu). For questions about the method, contact Lauren Beesley (lbeesley@umich.edu).

Reference

Statistical inference for association studies using electronic health records: handling both selection bias and outcome misclassification Lauren J Beesley, Bhramar Mukherjee medRxiv 2019.12.26.19015859

Copy Link

Version

Install

install.packages('SAMBA')

Monthly Downloads

193

Version

0.9.0

License

GPL-3

Maintainer

Alexander Rix

Last Published

February 20th, 2020

Functions in SAMBA (0.9.0)

approxdist

Estimate parameters in the disease model approximating the observed data distribution
nonlogistic

Estimate parameters in the disease model given sensitivity as a function of covariates.
obsloglik

Estimate parameters in the disease model using observed data log-likelihood using direct maximization.
obsloglikEM

Estimate parameters in the disease model using observed data log-likelihood using the expectation-maximization algorithm
sensitivity

Estimate sensitivity
samba.df

Synthetic example data for SAMBA adapted from the vignette