Learn R Programming

BioInstaller

Introduction

The increase in bioinformatics resources such as tools/scripts and databases poses a great challenge for users seeking to construct interactive and reproducible biological data analysis applications.

R language, as the most popular programming language for statistics, biological data analysis, and big data, has enabled diverse and free R packages (>14000) for different types of applications. However, due to the lack of high-performance and open-source cloud platforms based on R (e.g., Galaxy for Python users), it is still difficult for R users, especially those without web development skills, to construct interactive and reproducible biological data analysis applications supporting the upload and management of files, long-time computation, task submission, tracking of output files, exception handling, logging, export of plots and tables, and extendible plugin systems.

The collection, management, and share of various bioinformatics tools/scripts and databases are also essential for almost all bioinformatics analysis projects.

Here, we established a new platform to construct interactive and reproducible biological data analysis applications based on R language. This platform contains diverse user interfaces, including the R functions and R Shiny application, REST APIs, and support for collecting, managing, sharing, and utilizing massive bioinformatics tools/scripts and databases.

Feature:

  • Easy-to-use
  • User-friendly Shiny application
  • Integrative platform of Databases and bioinformatics resources
  • Open source and completely free
  • One-click to download and install bioinformatics resources (via R, Shiny or Opencpu REST APIs)
  • More attention for those software and database resource that have not been by other tools
  • Logging
  • System monitor
  • Task submitting system
  • Parallel tasks

Field

  • Quality Control
  • Alignment And Assembly
  • Alternative Splicing
  • ChIP-seq analysis
  • Gene Expression Data Analysis
  • Variant Detection
  • Variant Annotation
  • Virus Related
  • Statistical and Visualization
  • Noncoding RNA Related Database
  • Cancer Genomics Database
  • Regulator Related Database
  • eQTL Related Database
  • Clinical Annotation
  • Drugs Database
  • Proteomic Database
  • Software Dependence Database
  • ......

Installation

CRAN

#You can install this package directly from CRAN by running (from within R):
install.packages('BioInstaller')

Github

# install.packages("devtools")
devtools::install_github("JhuangLab/BioInstaller")

Shiny UI overview

Note, the Shiny application of BioInstaller was migrated to bioshiny project. All shiny files in this package will deprecated soon.

In the new project, we will provides one-click method (command line and Shiny UI) to create bioshiny application and its plugins. We are also developing more free plugins of bioshiny for various bioinformatics data analysis.

# Start the standalone Shiny application
BioInstaller::web(auto_create = TRUE)

Contributed Resources

Support Summary

Quality Control:

  • FastQC, PRINSEQ, SolexaQA, FASTX-Toolkit ...

Alignment and Assembly:

  • BWA, STAR, TMAP, Bowtie, Bowtie2, tophat2, hisat2, GMAP-GSNAP, ABySS, SSAHA2, Velvet, Edean, Trinity, oases, RUM, MapSplice2, NovoAlign ...

Variant Detection:

  • GATK, Mutect, VarScan2, FreeBayes, LoFreq, TVC, SomaticSniper, Pindel, Delly, BreakDancer, FusionCatcher, Genome STRiP, CNVnator, CNVkit, SpeedSeq ...

Variant Annotation:

  • ANNOVAR, SnpEff, VEP, oncotator ...

Utils:

  • htslib, samtools, bcftools, bedtools, bamtools, vcftools, sratools, picard, HTSeq, seqtk, UCSC Utils(blat, liftOver), bamUtil, jvarkit, bcl2fastq2, fastq_tools ...

Genome:

  • hisat2_reffa, ucsc_reffa, ensemble_reffa ...

Others:

  • sparsehash, SQLite, pigz, lzo, lzop, bzip2, zlib, armadillo, pxz, ROOT, curl, xz, pcre, R, gatk_bundle, ImageJ, igraph ...

Databases:

  • ANNOVAR, blast, CSCD, GATK_Bundle, biosystems, civic, denovo_db, dgidb, diseaseenhancer, drugbank, ecodrug, expression_atlas, funcoup, gtex, hpo, inbiomap, interpro, medreaders, mndr, msdd, omim, pancanqtl, proteinatlas, remap2, rsnp3, seecancer, srnanalyzer, superdrug2, tumorfusions, varcards ...

Docker

You can use the BioInstaller in Docker since v0.3.0. Shiny application was supported since v0.3.5.

docker pull bioinstaller/bioinstaller
docker run -it -p 80:80 -p 8004:8004 -v /tmp/download:/tmp/download bioinstaller/bioinstaller

Service list:

  • localhost/ocpu/ Opencpu service
  • localhost/shiny/BioInstaller Shiny service
  • localhost/rstudio/ Rstudio server (opencpu/opencpu)

Citation

  • Li J, Cui B, Dai Y, et al. BioInstaller: a comprehensive R package to construct interactive and reproducible biological data analysis applications based on the R platform[J]. PeerJ, 2018, 6:e5853.

How to contribute?

Please fork the GitHub BioInstaller repository, modify it, and submit a pull request to us. Especialy, the files list in contributed section should be modified when you see a tool or database that not be included in the other software warehouse.

Maintainer

Jianfeng Li

License

R package:

MIT

Related Other Resources

Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 International License

Copy Link

Version

Install

install.packages('BioInstaller')

Monthly Downloads

95

Version

0.3.7

License

MIT + file LICENSE

Issues

Pull Requests

Stars

Forks

Maintainer

Jianfeng Li

Last Published

November 20th, 2018

Functions in BioInstaller (0.3.7)

get.meta.files

Get all BioInstaller meta files path, such as database, GitHub source, non-GitHub source, web source
conda.list

Wrapper function of 'conda list', list linked packages in a conda environment.
docker.pull

Use docker to pull image
docker.search

Search softwares docker infomation in BioInstaller docker database
spack

Wrapper function of spack
is.biosoftwares.db.active

Test active configuration file
new.bioinfo

Create new BioInstaller items to github forum
install.nongithub

Install or download softwares from non-Github Web site
install.github

Install or download softwares from Github
show.installed

Show all installed bio-softwares in system
spack.list

Wrapper function of 'spack list', list and search available packages
install.bioinfo

Download and install biology software or database
web

Function to run BioInstaller shiny APP service
set.biosoftwares.db

Set BIO_SOFWARES_DB_ACTIVE as the BioInstaller db
get.info

Show biologly softwares infomation of system
set_shiny_workers

Function to set shiny workers for background service
get.meta

Get meta information of BioInstaller collected sources, such as database, GitHub source, non-GitHub source, web source
del.info

Delete biologly softwares infomation of system
crawl.all.versions

A function can be used to craw all source code from nongithub.cfg stored information
BioInstaller

This package is a new platform to construct interactive and reproducible biological data analysis applications based on R language, which includes the R functions and R Shiny application, REST APIs.
change.info

Update biologly softwares infomation of system
conda.env.list

Wrapper function of 'conda env list', list the Conda environments
conda

Wrapper function of conda
conda.env.create

Wrapper function of 'conda env create', create an environment based on an environment file
copy_plugins

Function to copy the default plugins of BioInstaller
copy_configs

Function to copy the default configuration file of BioInstaller