Learn R Programming

⚠️There's a newer version (10.0.2) of this package.Take me there.

msigdbr: MSigDB Gene Sets for Multiple Organisms in a Tidy Data Format

Overview

The msigdbr R package provides Molecular Signatures Database (MSigDB) gene sets typically used with the Gene Set Enrichment Analysis (GSEA) software:

  • in an R-friendly tidy/long format with one gene per row
  • for multiple frequently studied model organisms, such as mouse, rat, pig, zebrafish, fly, and yeast, in addition to the original human genes
  • as gene symbols as well as NCBI Entrez and Ensembl IDs
  • that can be installed and loaded as a package without requiring additional external files

Installation

The package can be installed from CRAN.

install.packages("msigdbr")

Usage

The package data can be accessed using the msigdbr() function, which returns a data frame of gene sets and their member genes. For example, you can retrieve mouse genes from the C2 (curated) CGP (chemical and genetic perturbations) gene sets.

genesets = msigdbr(species = "Mus musculus", category = "C2", subcategory = "CGP")

Check the documentation website for more information.

Copy Link

Version

Install

install.packages('msigdbr')

Monthly Downloads

17,341

Version

7.5.1

License

MIT + file LICENSE

Maintainer

Igor Dolgalev

Last Published

March 30th, 2022

Functions in msigdbr (7.5.1)

%>%

Pipe operator
msigdbr

Retrieve the gene sets data frame
msigdbr_collections

List the collections available in the msigdbr package
msigdbr_species

List the species available in the msigdbr package
msigdbr_show_species

List the species available in the msigdbr package