sigCheckKnown(expressionSet, classes, signature, annotation, validationSamples, classifierMethod = svmI, classifierScore, knownSignatures="cancer")ExpressionSet object containing the data to be checked,
including an expression matrix, feature labels, and samples.
varLabels(expressionSet)). There should be only two
unique values in expressionSet$classes.
annotation parameter (default is row names in the expressionSet).
Alternatively, this can be a integer vector of feature indexes.
featureData field should be
used as the annotation. If missing, the row names of the expressionSet are
used as the feature names.
classifierScore element of the result list returned by
sigCheckClassifier.
If missing, sigCheckClassifier will be called to establish
baseline performance.
knownSignatures, or a list of previously
identified signatures to compare performance against. Each element in the
list should be a vector of feature labels. Default is to use the
"cancer" signatures from the included knownSignatures
data set, taken from Venet et. al.
$sigPerformance is the percentage of validationSamples correctly
classified (or, in the LOO case, the percentage of total samples correctly
classified by classifiers trained using the remaining samples.)$modePerformance is the percentage of validationSamples correctly
classified by a "mode" classifier (or, in the LOO case, the percentage of total
samples correctly classified by a "mode" classifier, which is equal the number
of samples with the more-frequent category.) The "mode" classifier always
predicts the category that appears most often in the training set.
If the training set is balanced between categories, one category will
always be predicted.$known is a character string indicating which gene signature set
was checked. Either one of the sets in knownSignatures, or the
string "user specified".$knownSigs is the number of signatures evaluated (equal to
length(knownSignatures), minus any signatures with zero features
matching the labels in expressionSet.)$rank is the performance rank of the primary signature classifier
on the original dataset amongst the performances of the known signatures
on the same dataset.$performanceKnown is a vector of performance scores (proportion
of the validation set correctly predicted) for each known signature
on the dataset.
sigCheckClassifier is called for each of the known signatures.
knownSignatures, sigCheck,
sigCheckClassifier, sigCheckRandom,
sigCheckPermuted, MLearn
library(breastCancerNKI)
data(nki)
nki <- nki[,!is.na(nki$e.dmfs)]
data(knownSignatures)
results <- sigCheckKnown(nki, classes="e.dmfs",
signature=knownSignatures$cancer$VANTVEER,
annotation="HUGO.gene.symbol",
validationSamples=275:319)
Run the code above in your browser using DataLab