Learn R Programming

ScISI (version 1.44.0)

getLocOrfs: A function to obtain ORFs for the ScISI

Description

This function takes a list of bi-partite graph incidence matrices as well as a vector of GO nodes, and for each protein complex of each bi-partite graph, finds those constituent member proteins which are annotated to each GO node.

Usage

getLocOrfs(imList, goNode, pathToSave = NULL, name = NULL)

Arguments

imList
A list of bi-partite graph incidence matrices
goNode
A character vector of GO ID's
pathToSave
A character denoting where to save the resultant data
name
The name by which the resulting R-object as well as the file will be named

Value

The return value is a list of lists of lists:1. The top level list contains entries named by each GO ID 2. The second tier level contains entries after each entry of imList 3. The third tier level contains three entries: i. A bi-partite graph incidence matrix which only has protein complexes restricted to the respective GO ID annotation ii. A bi-partite graph incidence matrix which only has protein complexes listed in (i) but contains all the original proteins iii. A vector of proportions. We look at the column sums of (i) divided by the column sums fo (ii). This gives the ratio of proteins which are annotated by the GO ID over all other proteins in that complex
restrictedOrfsComp
A bi-partite graph incidence matrix which only has protein complexes restricted to the respective GO ID annotation
restrictedOrfsOnly
A bi-partite graph incidence matrix which only has protein complexes listed in (i) but contains all the original proteins
ratio
A vector of proportions. We look at the column sums of (i) divided by the column sums fo (ii). This gives the ratio of proteins which are annotated by the GO ID over all other proteins in that complex

Details

Currently, this function will take a protein complex and find those constituent protein members which have also been annotated to the user specified GO ID. For instance, the GO ID "" is the nucleus ID. This function will find all proteins within an interactome which has been annotated with the ID "". NB - if n GO ID is supplied, the function returns n different interactomes, each is has proteins uniquely annotated with each GO ID.

References

~put references to the literature/web site here ~