GEOsearch (version 1.2.2)

KeyWordFreq: KeyWordFreq

Description

Frequencies of common biology keywords appearing in search results

Usage

KeyWordFreq(searchtable, category = c("celltype", "disease", "tissue"))

Arguments

searchtable
The direct output of function "GEOsearchterm"
category
A character vector specifying which category in the common biology keyword list to be used. Should be contain "celltype", "disease" or "tissue".

Value

A data.frame with the frequency of each common biology keyword. First column: keyword name; Second column: kyeword frequency; Third column: log fold change of the frequency; Fourth column: FDR of fisher test.

Details

This function calculates the frequencies of each common biology keyword appearing in the given search table. The list of common biology keywords is compiled from http://www.atcc.org/. The list contains three categories: cell types, diseases and tissues. Users can specify which category to be used. The function also returns log fold change and FDR of fisher test to check whether each keyword has significantly more appearance compared to base frequency. The base frequency is defined as the number of appearance of the key word in all samples (roughly 40000 samples) included in GEO database.

Examples

Run this code
KeyWordFreq(GEOSearchTerm("Oct4 RNA-seq"))

Run the code above in your browser using DataLab