Learn R Programming

CoGAPS (version 2.6.0)

Coordinated Gene Activity in Pattern Sets

Description

Coordinated Gene Activity in Pattern Sets (CoGAPS) implements a Bayesian MCMC matrix factorization algorithm, GAPS, and links it to gene set statistic methods to infer biological process activity. It can be used to perform sparse matrix factorization on any data, and when this data represents biomolecules, to do gene set analysis.

Copy Link

Version

Version

2.6.0

License

GPL (==2)

Maintainer

Elana J Fertig

Last Published

February 15th, 2017

Functions in CoGAPS (2.6.0)

CoGAPS

CoGAPS calls the C++ MCMC code through gapsRun and performs Bayesian matrix factorization returning the two matrices that reconstruct the data matrix and then calls calcCoGAPSStat to estimate gene set activity with nPerm set to 500
gapsMapRun

gapsMapRun calls the C++ MCMC code and performs Bayesian matrix factorization returning the two matrices that reconstruct the data matrix; as opposed to gapsRun, this method takes an additional input specifying set patterns in the P matrix
SimpSim.D

Simulated data
plotGAPS

Plotter for GAPS decomposition results
SimpSim.A

Simulated data
calcCoGAPSStat

CoGAPS gene set statistic
plotAtoms

plotAtoms a simple plot of the number of atoms from one of the vectors returned with atom numbers
calcZ

calcZ calculates the Z-score for each element based on input mean and standard deviation matrices
GIST.D

Sample GIST gene expression data from Ochs et al. (2009).
residuals

residuals calculate residuals and produce heatmap
plotSmoothPatterns

Plot loess smoothed CoGAPS patterns
tf2ugFC

Gene sets defined by transcription factors defined from TRANSFAC.
gapsMapTestRun

gapsMapTestRun calls the C++ MCMC code and performs Bayesian matrix factorization returning the two matrices that reconstruct the data matrix; as opposed to gapsRun, this method takes an additional input specifying set patterns in the P matrix. Test procedures allow for the returning of the matrix and atomic information for A and P during each step of the equilibration and sampling.
plotP

plotP plots the P matrix in a line plot with error bars
plotDiag

plotDiag plots a series of diagnostic plots
GIST.S

Sample GIST gene expression data from Ochs et al. (2009).
CoGAPS-package

CoGAPS: Coordinated Gene Analysis Pattern Sets
binaryA

binaryA creates a binarized heatmap of the A matrix in which the value is 1 if the value in Amean is greater than threshold * Asd and 0 otherwise
SimpSim.S

Simulated data
gapsIntraPattern

gapsIntraPattern generates statistics for the similarity of gene expression vectors within a pattern
computeGeneGSProb

CoGAPS gene membership statistic
reorderByPatternMatch

Match two sets of patterns found with CoGAPS
gapsRun

gapsRun calls the C++ MCMC code and performs Bayesian matrix factorization returning the two matrices that reconstruct the data matrix
GSets

Simulated dataset to quantify gene set membership.
gapsInterPattern

gapsInterPattern calculates statistics for measuring the distance between patterns based on genes associated with the patterns
gapsTestRun

gapsTestRun calls the C++ MCMC code and performs Bayesian matrix factorization returning the two matrices that reconstruct the data matrix. Test procedures allow for the returning of the matrix and atomic information for A and P during each step of the equilibration and sampling. .
SimpSim.P

Simulated data