Learn R Programming

Multiple Imputation for Causal Graph Discovery (micd)

Add-on to the R package pcalg for handling missing data in contrataint-based causal graph discovery. Supports continuous, discrete and mixed data. Two options are available: 1) gaussCItwd, disCItwd and mixedCItwd perform test-wise deletion, where missing observations are deleted as necessary on a test-by-test basis; 2) gaussMItest, disMItest and mixedMItest perform conditional independence tests on multiply imputed data. Each of these functions can be used as a plug-in to pcalg::skeleton, pcalg::pc or pcalg::fci.

Additionally, the package contains all functions required for replicating the analyses in Foraita et al. (2020).

Installation

Install the packages graph and RBGL from Bioconductor. Make sure that rtools is installed on your computer. Then, simply type in R to install micd

devtools::install_github("bips-hb/micd")

(Note: The latest micd was created using R 4.2.1)

References

Foraita R, Friemel J, Günther K, Behrens T, Bullerdiek J, Nimzyk R, Ahrens W, Didelez V (2020). Causal discovery of gene regulation with incomplete data. Journal of the Royal Statistical Society: Series A (Statistics in Society), 183(4), 1747-1775. URL https://rss.onlinelibrary.wiley.com/doi/10.1111/rssa.12565.

Foraita R, Witte J, Börnhorst C, Gwozdz W, Pala V, Lissner L, Lauria F, Reisch L, Molnár D, De Henauw S, Moreno L, Veidebaum T, Tornaritis M, Pigeot I, Didelez V. A longitudinal causal graph analysis investigating modifiable risk factors and obesity in a European cohort of children and adolescents. 2022; medRxiv 2022.05.18.22275036.

Witte J, Foraita R, Didelez V (2022). Multiple imputation and test-wise deletion for causal discovery with incomplete cohort data. Statistics in Medicine, https://doi.org/10.1002/sim.9535.

References for pcalg

Markus Kalisch, Martin Mächler, Diego Colombo, Marloes H. Maathuis, Peter Bühlmann (2012). Causal Inference Using Graphical Models with the R Package pcalg. Journal of Statistical Software, 47(11), 1-26. URL www.jstatsoft.org/v47/i11/.

Alain Hauser, Peter Bühlmann (2012). Characterization and greedy learning of interventional Markov equivalence classes of directed acyclic graphs. Journal of Machine Learning Research, 13, 2409-2464. URL https://www.jmlr.org/papers/v13/hauser12a.html.

Copy Link

Version

Install

install.packages('micd')

Monthly Downloads

251

Version

1.1.1

License

GPL (>= 3)

Issues

Pull Requests

Stars

Forks

Maintainer

Ronja Foraita

Last Published

February 17th, 2023

Functions in micd (1.1.1)

mixMItest

Likelihood Ratio Test for (Conditional) Independence between Mixed Variables after Multiple Imputation
skeletonMI

Estimate (Initial) Skeleton of a DAG using the PC Algorithm for Multiple Imputed Data Sets of Continuous Data
with_graph

Evaluate Causal Graph Discovery Algorithm in Multiple Imputed Data sets
mixCItwd

Likelihood Ratio Test for (Conditional) Independence between Mixed Variables with Missings
mixCItest

Likelihood Ratio Test for (Conditional) Independence between Mixed Variables
pcMI

Estimate the Equivalence Class of a DAG Using the PC-MI Algorithm for Multiple Imputed Data Sets
make.formulas.saturated

Creates a formulas Argument
makeResiduals

Generate residuals based on variables in imputed data sets
flexCItest

Wrapper for gaussCItest, disCItest and mixCItest
gaussMItest

Test Conditional Independence of Gaussians via Fisher's Z Using Multiple Imputations
disMItest

G square Test for (Conditional) Independence between Discrete Variables after Multiple Imputation
gaussCItwd

Fisher's z-Test for (Conditional) Independence between Gaussian Variables with Missings
boot.graph

Bootstrap Resampling for the PC-MI- and the FCI-MI-algorithm
disCItwd

G square Test for (Conditional) Independence between Discrete Variables with Missings
flexMItest

Wrapper for gaussMItest, disMItest and mixMItest
getSuff

Obtain 'suffStat' for conditional independence testing
flexCItwd

Wrapper for gaussCItwd, disCItwd and mixCItwd
fciMI

Estimate a PAG by the FCI-MI Algorithm for Multiple Imputed Data Sets of Continuous Data