Learn R Programming

geecc (version 1.4.0)

GO2list: Filter GO and KEGG database

Description

Filter GO and KEGG database and transform database to list

Usage

GO2list(dbase, go.cat = NULL, rm = NULL, keep = NULL) KEGG2list(dbase, rm = NULL, keep = NULL) GO2offspring(x) GO2level(x, go.level=-1, relation=c("is_a"))

Arguments

dbase
A datastructure storing identifieres of GO/KEGG terms and assigned genes. Can be one of
database
usually of class ‘ProbeGo3AnnDbBimap’ (as defined in package “AnnotationDbi”)

named list
with keys being the identifiers and values being genes

dataframe
with first column being the identifiers and second column being genes. Additional columns are ignored.

x
a list with keys being the identifiers and values being genes (e.g. output of GO2list)
go.cat
GO category ("MF", "BP", "CC") that should be returned and filtered
go.level
Level in the DAG of GO terms. Defaults to “-1” for pass through without modification. Otherwise: a positive integer giving the level at which GO terms should be grouped together.
rm
remove these terms
keep
keep only these terms
relation
relationships in GO hierarchy that should be considered. Defaults to “is_a”

Value

A named list with each slot containing the ids for the term or pathway.

Details

The settings for “rm” and “keep” can be combined, allowing for efficient reduction of the number of GO terms and KEGG pathways, respectively.

Providing a named list instead of a database can be useful for non-model organisms, where only a draft Blast2GO-annotation is available. In this case, the names of the list are the GO terms (or KEGG pathways) and the content of each list item is a character vector with tag-ids.

The function GO2offspring does the same as the databaseGO2ALLPROBES function does (e.g. hgu133plus2GO2ALLPROBES). I.e. instead of representing only features (probe sets, genes, ...) assigned to the GO terms directly, it also contains all features assigned to all children (offsprings).

The function GO2level groups GO terms together at a more general level to simplify data interpretation and speed up runtime. This function works according to the level option provided by DAVID, but the number of levels is not restricted.

Examples

Run this code
library(hgu133plus2.db)
x <- GO2list(dbase=hgu133plus2GO2PROBE, go.cat="CC",
	rm=c("GO:0000139", "GO:0000790", "GO:0005730", "GO:0005739"))

Run the code above in your browser using DataLab