simulateGEdata returns simulated noisy gene expression values of specified size
and its underlying gene-gene correlation.simulateGEdata(n, m, k, size.alpha, corr.strength, g = NULL,
Sigma.eps = 0.1, nc, ne, intercept = TRUE, check.input = FALSE)k, corr.strength)) giving the
correlation between $X$ and $W$ or NULL for independence.Sigma.eps $>0$.TRUE all input is checked
(not advisable for large simulations).simulateGEdata returns output of the class simulateGEdata.
An object of class simulateGEdata is a list with the
following components:
TruthYNoiseSigmaInfon genes in
m arrays. The expression values consist of true expression and noise:
$$Y=X\beta+W\alpha+\epsilon$$
The dimensions of the matrices $X$ and $\beta$ are used to control the size of
the correlation between the genes. It is possible to simualte three different classes
of genes:
nc genes in the data,
whereas the strongly expressed genes are always the first ne genes in the data.
The parameter intercept controls whether the systematic noise has an
offset or not. Note that the intercept is one dimension of $W$.
It is possible to either simulate data where $W$ and $X$ are independent by
setting g to NULL, or increasing correlation $bWX$ between
$W$ and $X$ by increasing g.Y<-simulateGEdata(500, 500, 10, 2, 5, g=NULL, Sigma.eps=0.1,
250, 100, intercept=TRUE, check.input=TRUE)
Y
Y<-simulateGEdata(500, 500, 10, 2, 5, g=3, Sigma.eps=0.1,
250, 100, intercept=TRUE, check.input=TRUE)
YRun the code above in your browser using DataLab