Learn R Programming

⚠️There's a newer version (1.6.2) of this package.Take me there.

systemPipeR: NGS workflow and report generation environment

systemPipeR is an R/Bioconductor package for building end-to-end analysis pipelines with automated report generation for next generation sequence (NGS) applications such as RNA-Seq, ChIP-Seq, VAR-Seq and many others. An important feature is support for running command-line software, such as NGS aligners, on both single machines or compute clusters. This includes both interactive job submissions or batch submissions to queuing systems of clusters. Efficient handling of complex sample sets and experimental designs is facilitated by a well-defined sample annotation infrastructure which improves reproducibility and user-friendliness of many typical analysis workflows in the NGS area.

Installation

To install the package, please use the biocLite method as instructed here.

Usage

Instructions for running systemPipeR are given in its vignette (manual). The sample data set used in the vignette can be downloaded here. The expected format to define NGS samples (e.g. FASTQ files) and their labels are given in targets.txt and targetsPE.txt (latter is for PE reads). The run parameters of command-line software are defined by param files that have a simplified JSON-like name/value structure. Here is a sample param file for Tophat2: tophat.param. Templates for setting up custom project reports are provided as *.Rnw files in the vignettes subdirectory of this package. The corresponding PDFs of these report templates are linked here: systemPipeRNAseq, systemPipeChIPseq and systemPipeVARseq.

Copy Link

Version

Version

1.2.23

License

Artistic-2.0

Issues

Pull Requests

Stars

Forks

Maintainer

Thomas Girke

Last Published

February 15th, 2017

Functions in systemPipeR (1.2.23)

catDB-class

Class "catDB"
filterDEGs

Filter and plot DEG results
alignStats

Alignment statistics
countRangeset

Read counting for several range sets
filterVars

Filter VCF files
INTERSECTset-class

Class "INTERSECTset"
GOHyperGAll

GO term enrichment analysis for large numbers of gene sets
getQsubargs

Arguments for qsub
clusterRun

Submit command-line tools to cluster
catmap

catDB accessor methods
overLapper

Set Intersect and Venn Diagram Functions
olBarplot

Bar plot for intersect sets
returnRPKM

RPKM Normalization
readComp

Import sample comparisons from targets file
run_DESeq2

Runs DESeq2
moduleload

Interface to module system
mergeBamByFactor

Merge BAM files based on factor
run_edgeR

Runs edgeR
preprocessReads

Run custom read preprocessing functions
qsubRun

Submit command-line tools to cluster
writeTargetsout

Write updated targets out to file
SYSargs-class

Class "SYSargs"
sysargs

SYSargs accessor methods
writeTargetsRef

Generate targets file with reference
vennPlot

Plot 2-5 way Venn diagrams
VENNset-class

Class "VENNset"
runDiff

Differential abundance analysis for many range sets
runCommandline

Execute SYSargs
variantReport

Generate Variant Report
systemArgs

Constructs SYSargs object from param and targets files
seeFastq

Quality reports for FASTQ files
symLink2bam

Symbolic links for IGV