simulateGEdata
returns simulated noisy gene expression values of specified size
and its underlying gene-gene correlation.simulateGEdata(n, m, k, size.alpha, corr.strength, g = NULL,
Sigma.eps = 0.1, nc, ne, intercept = TRUE, check.input = FALSE)
k
, corr.strength
)) giving the
correlation between $X$ and $W$ or NULL
for independence.Sigma.eps
$>0$.TRUE
all input is checked
(not advisable for large simulations).simulateGEdata
returns output of the class simulateGEdata
.
An object of class simulateGEdata
is a list
with the
following components:
Truth
Y
Noise
Sigma
Info
n
genes in
m
arrays. The expression values consist of true expression and noise:
$$Y=X\beta+W\alpha+\epsilon$$
The dimensions of the matrices $X$ and $\beta$ are used to control the size of
the correlation between the genes. It is possible to simualte three different classes
of genes:
nc
genes in the data,
whereas the strongly expressed genes are always the first ne
genes in the data.
The parameter intercept
controls whether the systematic noise has an
offset or not. Note that the intercept is one dimension of $W$.
It is possible to either simulate data where $W$ and $X$ are independent by
setting g
to NULL, or increasing correlation $bWX$ between
$W$ and $X$ by increasing g
.Y<-simulateGEdata(500, 500, 10, 2, 5, g=NULL, Sigma.eps=0.1,
250, 100, intercept=TRUE, check.input=TRUE)
Y
Y<-simulateGEdata(500, 500, 10, 2, 5, g=3, Sigma.eps=0.1,
250, 100, intercept=TRUE, check.input=TRUE)
Y
Run the code above in your browser using DataLab