SigCheck
that runs all available
checks against a classification signature.
sigCheck(expressionSet, classes, signature, annotation, validationSamples, classifierMethod = svmI, nIterations = 10, knownSignatures="cancer", plotResults=TRUE)
ExpressionSet
object containing the data to be checked,
ncluding an expression matrix, feature labels, and samples.
annotation
parameter (default is row names in the expressionSet).
Alternatively, this can be a integer vector of feature indexes.
featureData
field should be
used as the annotation. If missing, the row names of the expressionSet are
used as the feature names.
expressionSet
should be used for validation. If present, a classifier
will be trained, using the specified signature and classification method, on
the non-validation samples, and it's performance evaluated by attempting to
classify the validations samples. If missing, a leave-one-out (LOO) validation
method will be used, where a separate classifier will be trained to classify
each sample using the reaming samples.
knownSignatures
, or a list of previously
identified signatures to compare performance against. Each element in the
list should be a vector of feature labels. Default is to use the
"cancer"
signatures from the included knownSignatures
data set, taken from Venet et. al.
TRUE
, will call sigCheckPlot
four times to plot the
results of all checks (laid out in a 2x2 plot matrix).
$checkClassifier
is the result list returned by
sigCheckClassifier
.$checkRandom
is the result list returned by
sigCheckRandom
.$checkKnown
is the result list returned by
sigCheckKnown
.$checkPermutedFeatures
is the result list returned by
sigCheckPermuted
with toPermute="features"
.$checkPermutedCategories
is the result list returned by
sigCheckPermuted
with toPermute="categories"
.
sigCheck
calls sigCheckClassifier
to establish the
baseline performance of the signature being checked.Next, it calls sigCheckRandom
to check the performance of randomly
selected signatures.
This is followed by a call to sigCheckKnown
to check the
performance of the signature against a database of signatures previously
identified to discriminate in other, generally unrelated domains.
Finally, two calls are made to sigCheckPermuted
to check the
performance of randomly permuted data; the first call permutes the rows
(toPermute="features"
), while the second call permutes the categories
(toPermute="categories"
).
sigCheckClassifier
, sigCheckRandom
,
sigCheckPermuted
, sigCheckKnown
,
MLearn
library(breastCancerNKI)
data(nki)
nki <- nki[,!is.na(nki$e.dmfs)]
data(knownSignatures)
results <- sigCheck(nki, classes="e.dmfs",
annotation="HUGO.gene.symbol",
signature=knownSignatures$cancer$VANTVEER,
validationSamples=275:319, nIterations=5)
Run the code above in your browser using DataLab