DGEList-class: Digital Gene Expression data - class
Description
A list-based S4 class for storing read counts and associated information from digital gene expression or sequencing technologies.
List Components
For objects of this class, rows correspond to genomic features and columns to samples.
The genomic features are called genes, but in reality might correspond to transcripts, tags, exons etc.
Objects of this class contain the following essential list components:
counts:- numeric matrix of read counts, one row for each gene and one column for each sample.
samples:- data.frame with a row for each sample and columns
group, lib.size and norm.factors containing the group labels, library sizes and normalization factors.
Other columns can be optionally added to give more detailed sample information.
Optional components include:
genes:- data.frame giving annotation information for each gene. Same number of rows as
counts. AveLogCPM:- numeric vector giving average log2 counts per million for each gene.
common.dispersion:- numeric scalar giving the overall dispersion estimate.
trended.dispersion:- numeric vector giving trended dispersion estimates for each gene.
tagwise.dispersion:- numeric vector giving tagwise dispersion estimates for each gene (note that `tag' and `gene' are synonymous here).
offset:- numeric matrix of same size as
counts giving offsets for use in log-linear models.
Methods
This class inherits directly from class list, so DGEList objects can be manipulated as if they were ordinary lists.
However they can also be treated as if they were matrices for the purposes of subsetting. The dimensions, row names and column names of a DGEList object are defined by those of counts, see dim.DGEList or dimnames.DGEList. DGEList objects can be subsetted, see subsetting. DGEList objects also have a show method so that printing produces a compact summary of their contents.