length=ncol(M). The AGSs for each of the Nsamples are created with one of the five available methods (see parameter method).
samples2ags(m0, Ntop = NA, col.mask = NA, namesFromColumn = NA, method = c("significant", "top", "toppos", "topnorm", "toprandom"), Lowercase = 1, cutoff.q = 0.05)i deviate from the mean over all the samples, requiring q-value (Benjamini-Hochberg FDR) be below cutoff.q.
(x[i] - mean(x)) / sd(x) but does not evaluate significance. Instead, top N ranked genes Ntop are taken into AGS.
x[i] - mean(x) is not divided with sd(x). This might help to prioritize genes with higher mean(x) and ignore ones with low signal. Consider also that AGSs from "top" overlap much more with each other than those from "topnorm", i.e. would be less sample-specific.
x[i] - mean(x). This might be useful when the gene expression values are small counts (such as in sincle-cell RNA sequencing), so that considering the left part of the distribution would not bring high-quality AGS.
data("fantom5.43samples", package="NEArender")
ags.list <- samples2ags(fantom5.43samples, cutoff.q = 0.01, method="significant")
Run the code above in your browser using DataLab