Learn R Programming

⚠️There's a newer version (1.6.2) of this package.Take me there.

systemPipeR: NGS workflow and report generation environment

systemPipeR is an R/Bioconductor package for building end-to-end analysis pipelines with automated report generation for next generation sequence (NGS) applications such as RNA-Seq, ChIP-Seq, VAR-Seq and many others. An important feature is support for running command-line software, such as NGS aligners, on both single machines or compute clusters. This includes both interactive job submissions or batch submissions to queuing systems of clusters. Efficient handling of complex sample sets and experimental designs is facilitated by a well-defined sample annotation infrastructure which improves reproducibility and user-friendliness of many typical analysis workflows in the NGS area.

Installation

To install the package, please use the biocLite method as instructed here.

Usage

Instructions for running systemPipeR are given in its vignette (manual). The sample data set used in the vignette can be downloaded here. The expected format to define NGS samples (e.g. FASTQ files) and their labels are given in targets.txt and targetsPE.txt (latter is for PE reads). The run parameters of command-line software are defined by param files that have a simplified JSON-like name/value structure. Here is a sample param file for Tophat2: tophat.param. Templates for setting up custom project reports are provided as *.Rnw files in the vignettes subdirectory of this package. The corresponding PDFs of these report templates are linked here: systemPipeRNAseq, systemPipeChIPseq and systemPipeVARseq.

Copy Link

Version

Version

1.0.12

License

Artistic-2.0

Issues

Pull Requests

Stars

Forks

Maintainer

Thomas Girke

Last Published

February 15th, 2017

Functions in systemPipeR (1.0.12)

INTERSECTset-class

Class "INTERSECTset"
catDB-class

Class "catDB"
clusterRun

Submit command-line tools to cluster
moduleload

Interface to module system
GOHyperGAll

GO term enrichment analysis for large numbers of gene sets
getQsubargs

Arguments for qsub
catmap

catDB accessor methods
alignStats

Alignment statistics
filterDEGs

Filter and plot DEG results
olBarplot

Bar plot for intersect sets
seeFastq

Quality reports for FASTQ files
run_DESeq2

Runs DESeq2
overLapper

Set Intersect and Venn Diagram Functions
symLink2bam

Symbolic links for IGV
qsubRun

Submit command-line tools to cluster
preprocessReads

Run custom read preprocessing functions
runCommandline

Execute SYSargs
run_edgeR

Runs edgeR
readComp

Import sample comparisons from targets file
returnRPKM

RPKM Normalization
VENNset-class

Class "VENNset"
sysargs

SYSargs accessor methods
writeTargetsout

Write updated targets out to file
SYSargs-class

Class "SYSargs"
vennPlot

Plot 2-5 way Venn diagrams
systemArgs

Constructs SYSargs object from param and targets files