Benjamin HaibeKains

Benjamin HaibeKains

6 packages on CRAN

4 packages on Bioconductor

CREAM

cran
99.99th

Percentile

Provides a new method for identification of clusters of genomic regions within chromosomes. Primarily, it is used for calling clusters of cis-regulatory elements (COREs). 'CREAM' uses genome-wide maps of genomic regions in the tissue or cell type of interest, such as those generated from chromatin-based assays including DNaseI, ATAC or ChIP-Seq. 'CREAM' considers proximity of the elements within chromosomes of a given sample to identify COREs in the following steps: 1) It identifies window size or the maximum allowed distance between the elements within each CORE, 2) It identifies number of elements which should be clustered as a CORE, 3) It calls COREs, 4) It filters the COREs with lowest order which does not pass the threshold considered in the approach.

genefu

bioconductor
99.99th

Percentile

This package contains functions implementing various tasks usually required by gene expression analysis, especially in breast cancer studies: gene mapping between different microarray platforms, identification of molecular subtypes, implementation of published gene signatures, gene selection, and survival analysis.

MM2S

cran
99.99th

Percentile

A single-sample classifier that generates Medulloblastoma (MB) subtype predictions for single-samples of human Medulloblastoma (MB) patients and model systems, including cell lines and mouse-models. The MM2S algorithm uses a systems-based methodology that facilitates application of the algorithm on samples irrespective of their platform or source of origin. MM2S demonstrates > 96% accuracy for patients of well-characterized normal cerebellum, Wingless (WNT), or Sonic hedgehog (SHH) subtypes, and the less-characterized Group4 (86%) and Group3 (78.2%). MM2S also enables classification of MB cell lines and mouse models into their human counterparts.This package contains function for implementing the classifier onto human data and mouse data, as well as graphical rendering of the results as PCA (Principal Component Analysis) plots and heatmaps. Deena Gendoo and Benjamin Haibe-Kains (2016) <doi:10.1186/s13029-016-0053-y>.

MM2Sdata

cran
99.99th

Percentile

Gene Expression datasets for the 'MM2S' package. Contains normalized expression data for Human Medulloblastoma ('GSE37418') as well as Mouse Medulloblastoma models ('GSE36594'). Deena Gendoo et al. (2015) <doi:10.1016/j.ygeno.2015.05.002>.

mRMRe

cran
99.99th

Percentile

"Computes mutual information matrices from continuous, categorical and survival variables, as well as feature selection with minimum redundancy, maximum relevance (mRMR) and a new ensemble mRMR technique with DOI: N De Jay et al. (2013) <doi:10.1093/bioinformatics/btt383>."

PharmacoGx

cran
99.99th

Percentile

Contains a set of functions to perform large-scale analysis of pharmacogenomic data.

SIGN

cran
99.99th

Percentile

Provides a classification framework to use expression patterns of pathways as features to identify similarity between biological samples. It provides a new measure for quantifying similarity between expression patterns of pathways.

predictionet

bioconductor
99.99th

Percentile

This package contains a set of functions related to network inference combining genomic data and prior information extracted from biomedical literature and structured biological databases. The main function is able to generate networks using Bayesian or regression-based inference methods; while the former is limited to < 100 of variables, the latter may infer networks with hundreds of variables. Several statistics at the edge and node levels have been implemented (edge stability, predictive ability of each node, ...) in order to help the user to focus on high quality subnetworks. Ultimately, this package is used in the 'Predictive Networks' web application developed by the Dana-Farber Cancer Institute in collaboration with Entagen.

survcomp

bioconductor
99.99th

Percentile

R package providing functions to assess and to compare the performance of risk prediction (survival) models.

saps

bioconductor
99.99th

Percentile

Functions implementing the Significance Analysis of Prognostic Signatures method (SAPS). SAPS provides a robust method for identifying biologically significant gene sets associated with patient survival. Three basic statistics are computed. First, patients are clustered into two survival groups based on differential expression of a candidate gene set. P_pure is calculated as the probability of no survival difference between the two groups. Next, the same procedure is applied to randomly generated gene sets, and P_random is calculated as the proportion achieving a P_pure as significant as the candidate gene set. Finally, a pre-ranked Gene Set Enrichment Analysis (GSEA) is performed by ranking all genes by concordance index, and P_enrich is computed to indicate the degree to which the candidate gene set is enriched for genes with univariate prognostic significance. A SAPS_score is calculated to summarize the three statistics, and optionally a Q-value is computed to estimate the significance of the SAPS_score by calculating SAPS_scores for random gene sets.