SigCheck that runs all available
checks against a classification signature.
sigCheck(expressionSet, classes, signature, annotation, validationSamples, classifierMethod = svmI, nIterations = 10, knownSignatures="cancer", plotResults=TRUE)ExpressionSet object containing the data to be checked,
ncluding an expression matrix, feature labels, and samples.
annotation parameter (default is row names in the expressionSet).
Alternatively, this can be a integer vector of feature indexes.
featureData field should be
used as the annotation. If missing, the row names of the expressionSet are
used as the feature names.
expressionSet should be used for validation. If present, a classifier
will be trained, using the specified signature and classification method, on
the non-validation samples, and it's performance evaluated by attempting to
classify the validations samples. If missing, a leave-one-out (LOO) validation
method will be used, where a separate classifier will be trained to classify
each sample using the reaming samples.
knownSignatures, or a list of previously
identified signatures to compare performance against. Each element in the
list should be a vector of feature labels. Default is to use the
"cancer" signatures from the included knownSignatures
data set, taken from Venet et. al.
TRUE, will call sigCheckPlot four times to plot the
results of all checks (laid out in a 2x2 plot matrix).
$checkClassifier is the result list returned by
sigCheckClassifier.$checkRandom is the result list returned by
sigCheckRandom.$checkKnown is the result list returned by
sigCheckKnown.$checkPermutedFeatures is the result list returned by
sigCheckPermuted
with toPermute="features".$checkPermutedCategories is the result list returned by
sigCheckPermuted
with toPermute="categories".
sigCheck calls sigCheckClassifier to establish the
baseline performance of the signature being checked.Next, it calls sigCheckRandom to check the performance of randomly
selected signatures.
This is followed by a call to sigCheckKnown to check the
performance of the signature against a database of signatures previously
identified to discriminate in other, generally unrelated domains.
Finally, two calls are made to sigCheckPermuted to check the
performance of randomly permuted data; the first call permutes the rows
(toPermute="features"), while the second call permutes the categories
(toPermute="categories").
sigCheckClassifier, sigCheckRandom,
sigCheckPermuted, sigCheckKnown,
MLearn
library(breastCancerNKI)
data(nki)
nki <- nki[,!is.na(nki$e.dmfs)]
data(knownSignatures)
results <- sigCheck(nki, classes="e.dmfs",
annotation="HUGO.gene.symbol",
signature=knownSignatures$cancer$VANTVEER,
validationSamples=275:319, nIterations=5)
Run the code above in your browser using DataLab