dcEnrichment
is supposed to conduct enrichment
analysis for an input group of domains using a specified
ontology. It returns an object of S4 class "Eoutput".
Enrichment analysis is based on either Fisher's exact test
or Hypergeometric test. The test can respect the hierarchy
of the ontology.dcEnrichment(data, domain = c("SCOP.sf", "SCOP.fa"), ontology =
c("GOBP",
"GOMF", "GOCC", "DO", "HPPA", "HPMI", "HPON", "MP", "EC", "KW", "UP"),
sizeRange = c(10, 1000), min.overlap = 3, which_distance = NULL,
test = c("HypergeoTest", "FisherTest", "BinomialTest"),
p.adjust.method = c("BH", "BY", "bonferroni", "holm", "hochberg",
"hommel"),
ontology.algorithm = c("none", "pc", "elim", "lea"), elim.pvalue =
0.01,
lea.depth = 2, verbose = T,
RData.location = "http://supfam.org/dcGOR/data")
Eoutput
, with following
slots: domain
: a character
specifying the domain identityontology
: a
character specifying the ontology usedterm_info
: a matrix of nTerm X 5 containing
term information, where nTerm is the number of terms in
consideration, and the 5 columns are "term_id" (i.e. "Term
ID"), "term_name" (i.e. "Term Name"), "namespace" (i.e.
"Term Namespace"), "distance" (i.e. "Term Distance") and
"IC" (i.e. "Information Content for the term based on
annotation frequency by it")anno
: a list of
terms, each storing annotated domain members. Always, terms
are identified by "term_id" and domain members identified
by their ids (e.g. sunids for SCOP domains)data
: a vector containing input data in
consideration. It is not always the same as the input data
as only those mappable are retainedoverlap
:
a list of terms, each storing domains overlapped between
domains annotated by a term and domains in the input data
(i.e. the domains of interest). Always, terms are
identified by "term_id" and domain members identified by
their ids (e.g. sunids for SCOP domains)zscore
: a vector containing z-scorespvalue
: a vector containing p-valuesadjp
: a vector containing adjusted p-values.
It is the p value but after being adjusted for multiple
comparisonsdcRDataLoader
, dcDAGannotate
,
Eoutput-class
, visEnrichment
,
dcConverter
# 1) load SCOP.sf (as 'InfoDataFrame' object)
SCOP.sf <- dcRDataLoader('SCOP.sf')
# randomly select 20 domains
data <- sample(rowNames(SCOP.sf), 20)
# 2) perform enrichment analysis, producing an object of S4 class 'Eoutput'
eoutput <- dcEnrichment(data, domain="SCOP.sf", ontology="GOMF")
eoutput
# 3) view the top 10 significance terms
view(eoutput, top_num=10, sortBy="pvalue", details=TRUE)
# 4) visualise the top 10 significant terms in the ontology hierarchy
# color-coded according to 10-based negative logarithm of adjusted p-values (adjp)
visEnrichment(eoutput)
Run the code above in your browser using DataLab