- object
A GSNData object.
- pathways_data
An (optional) data.frame containing the results of GSEA analysis. (Either this or the
filename
argument must be set.
- filename
An (optional) filename for data sets read from a text file containing GSEA results. This is ignored
if the pathways_data
argument is set.
- id_col
(optional) A character vector of length 1 indicating the name of the column used as a key for gene
sets or modules. This is normally the NAME
field of GSEA data, which must be the same as the names of gene
sets specified in the tmod object or in the list of gene set vectors specified with the geneSetCollection
argument used when building the gene set network. By default this value is 'NAME'
. The IDs must correspond
to the names of the gene sets provided, or an error will be thrown. NOTE: In the tmod::tmodImportMSigDB function
provided by the tmod package, the default ID is an MSigDB accession, but GSEA data sets do not use this accession.
The NAME
column used in GSEA results set corresponds instead to the STANDARD_NAME
field in the MSigDB
XML database file. This STANDARD_NAME
field is not preserved by the standard tmod::tmodImportMSigDB
utility function, but instead reformatted converting underscores to spaces and non-initial letters to lower case.
Therefore, when using GSEA data sets with an MSigDB gene set collection imported using tmod::tmodImportMSigDB
the NAME
fields need to be mapped to the ID
or vice versa.
- stat_col
(optional) A character vector of length 1 indicating the name of the column used as a statistic
to evaluate the quality of pathways results. The function scans through possible stat_col
values
("FDR q-val"
, "FDR.q.val"
, "FWER p-val"
, "FWER.p.val"
, "NOM p-val"
,
"NOM.p.val"
), and uses the first one it finds. (The presence of spaces and hyphens in the column names
necessitates flexibility here. Depending on how GSEA results sets are read in, spaces and hyphens may be substituted
with periods.)
- sig_order
(optional) Either 'loToHi'
(default) or 'hiToLo'
depending on the statistic used to
evaluate pathways results.
- n_col
(optional) Specifies the column containing the number of genes in the gene set. Generally, this is the number
of genes in the gene set that are attested in an expression data set (Defaults to 'SIZE').
- sep
A separator for text file import, defaults to "\t". Ignored if filename
is not specified.