Learn R Programming

⚠️There's a newer version (0.4.1) of this package.Take me there.

driveR: An R Package for Prioritizing Cancer Driver Genes Using Genomics Data

Cancer genomes contain large numbers of somatic alterations but few genes drive tumor development. Identifying cancer driver genes is critical for precision oncology. Most of current approaches either identify driver genes based on mutational recurrence or using estimated scores predicting the functional consequences of mutations.

driveR is a tool for personalized or batch analysis of genomics data for driver gene prioritization by combining genomics information and prior biological knowledge. As features, driveR uses coding impact metaprediction scores, non-coding impact scores, somatic copy number alteration scores, hotspot gene/double-hit gene condition, ‘phenolyzer’ gene scores and memberships to cancer-related KEGG pathways. It uses these features to estimate cancer-type-specific probabilities for each gene of being a cancer driver using the related task of a multi-task learning classification model.

The method is described in detail in Ülgen E, Sezerman OU. driveR: a novel method for prioritizing cancer driver genes using somatic genomics data. BMC Bioinformatics. 2021 May 24;22(1):263.https://doi.org/10.1186/s12859-021-04203-7

Installation

You can install the latest released version of driveR from CRAN via:

install.packages("driveR")

You can install the development version of driveR from GitHub with:

# install.packages("devtools")
devtools::install_github("egeulgen/driveR", build_vignettes = TRUE)

Usage

driveR has two main objectives:

  1. Prediction of impact of coding variants (achieved via predict_coding_impact())
  2. Prioritization of cancer driver genes (achieved via create_features_df() and prioritize_driver_genes())

Note that driveR require operations outside of R and depends on the outputs from the external tools ANNOVAR and phenolyzer.

For detailed information on how to use driveR, please see the vignette “How to use driveR” via vignette("how_to_use")

Copy Link

Version

Install

install.packages('driveR')

Monthly Downloads

196

Version

0.3.0

License

MIT + file LICENSE

Issues

Pull Requests

Stars

Forks

Maintainer

Ege Ulgen

Last Published

January 18th, 2022

Functions in driveR (0.3.0)

create_features_df

Create Data Frame of Features for Driver Gene Prioritization
create_noncoding_impact_score_df

Create Non-coding Impact Score Data Frame
determine_hotspot_genes

Determine Hotspot Containing Genes
specific_thresholds

Tumor type specific probability thresholds
prioritize_driver_genes

Prioritize Cancer Driver Genes
driveR

driveR: An R Package for Prioritizing Cancer Driver Genes Using Genomics Data
MTL_submodel_descriptions

MTL Sub-model Descriptions
example_cohort_scna_table

Example Cohort-level Somatic Copy Number Alteration Table
MCR_table

Table of Pan-Cancer Minimal Common Regions
example_cohort_features_table

Example Cohort-level Features Table for Driver Prioritization
KEGG_cancer_pathways

KEGG "Pathways in cancer"-related Pathways - Gene Sets
determine_double_hit_genes

Determine Double-Hit Genes
predict_coding_impact

Create Coding Impact Meta-prediction Score Data Frame
metapredictor_model

Random Forest Model for Coding Impact Metaprediction
KEGG_cancer_pathways_descriptions

KEGG "Pathways in cancer"-related Pathways - Descriptions
create_gene_level_scna_df

Create Gene-level SCNA Data Frame
TCGA_MTL_fit

Multi-Task Learning Model for Predicting Cancer Driver Genes
create_SCNA_score_df

Create SCNA Score Data Frame
example_scna_table

Example Somatic Copy Number Alteration Table
example_features_table

Example Features Table for Driver Prioritization