Learn R Programming

pathifier (version 1.10.0)

quantify_pathways_deregulation: Quantify deregulation of pathways in cancer

Description

Pathifier is an algorithm that infers pathway deregulation scores for each tumor sample on the basis of expression data. This score is determined, in a context-specific manner, for every particular dataset and type of cancer that is being investigated. The algorithm transforms gene-level information into pathway-level information, generating a compact and biologically relevant representation of each sample.

Usage

quantify_pathways_deregulation(data, allgenes, syms, pathwaynames, normals = NULL, 
ranks = NULL, attempts = 100, maximize_stability = TRUE, logfile = "", samplings = NULL,
min_exp = 4, min_std = 0.4)

Arguments

data
The n x m mRNA expression matrix, where n is the number of genes and m the number of samples.
allgenes
A list of n identifiers of genes.
syms
A list of p pathways, each pathway is a list of the genes it contains (as appear in "allgenes").
pathwaynames
The names of the p pathways.
normals
A list of m logicals, true if a normal sample, false if tumor.
ranks
External knowledge on the ranking of the m samples, if exists (to use initial guess)
attempts
Number of runs to determine stability.
maximize_stability
If true, throw away components leading to low stability of sampling noise.
logfile
Name of the file the log should be written to (use stdout if empty).
samplings
A matrix specifying the samples that should be chosen in each sampling attempt, chooses a random matrix if samplings is NULL.
min_exp
The minimal expression considered as a real signal. Any values below are thresholded to be min_exp.
min_std
The minimal allowed standard deviation of each gene. Genes with lower standard deviation are divided by min_std instead of their actual standard deviation. (Recommended: set min_std to be the technical noise).

Value

  • scoresThe deregulation scores, the main output of pathifier
  • genesinpathwayThe genes of each pathway used to devise its dregulation score
  • newmeanstdAverage standart devaition after omitting noisy components
  • origmeanstdOriginial average standart devaition, before omitting noisy components
  • pathwaysizeThe number of components used to devise the pathway score
  • curvesThe prinicipal curve learned for every pathway
  • curves_orderThe order of the points of the prinicipal curve learned for every pathway
  • zZ-scores of the expression matrix used to learn prinicpal curve
  • compinThe components not omitted due to noise
  • xmThe average expression over all normal samples
  • xsThe standart devation of expression over all normal samples
  • centerThe centering used by the PCA
  • rotThe matrix of variable loadings of the PCA
  • pctakenThe number of principal components used
  • samplingsA matrix specifying the samples that should be chosen in each sampling attempt
  • sucessPathways for which a deregulation score was sucessfully computed
  • logfileName of the file the log was written to

References

Drier Y, Sheffer M, Domany E. Pathway-based personalized analysis of cancer. Proceedings of the National Academy of Sciences, 2013, vol. 110(16) pp:6388-6393. (www.pnas.org/cgi/doi/10.1073/pnas.1219651110)

See more information on : http://www.weizmann.ac.il/pathifier/

Examples

Run this code
data(KEGG) # Two pathways of the KEGG database 
data(Sheffer) # The colorectal data of Sheffer et al.
PDS<-quantify_pathways_deregulation(sheffer$data, sheffer$allgenes,
  kegg$gs, kegg$pathwaynames, sheffer$normals, attempts = 100,
  logfile="sheffer.kegg.log", min_exp=sheffer$minexp, min_std=sheffer$minstd)

Run the code above in your browser using DataLab