sigCheckKnown(expressionSet, classes, signature, annotation, validationSamples, classifierMethod = svmI, classifierScore, knownSignatures="cancer")
ExpressionSet
object containing the data to be checked,
including an expression matrix, feature labels, and samples.
varLabels(expressionSet)
). There should be only two
unique values in expressionSet$classes
.
annotation
parameter (default is row names in the expressionSet).
Alternatively, this can be a integer vector of feature indexes.
featureData
field should be
used as the annotation. If missing, the row names of the expressionSet are
used as the feature names.
classifierScore
element of the result list returned by
sigCheckClassifier
.
If missing, sigCheckClassifier
will be called to establish
baseline performance.
knownSignatures
, or a list of previously
identified signatures to compare performance against. Each element in the
list should be a vector of feature labels. Default is to use the
"cancer"
signatures from the included knownSignatures
data set, taken from Venet et. al.
$sigPerformance
is the percentage of validationSamples correctly
classified (or, in the LOO case, the percentage of total samples correctly
classified by classifiers trained using the remaining samples.)$modePerformance
is the percentage of validationSamples correctly
classified by a "mode" classifier (or, in the LOO case, the percentage of total
samples correctly classified by a "mode" classifier, which is equal the number
of samples with the more-frequent category.) The "mode" classifier always
predicts the category that appears most often in the training set.
If the training set is balanced between categories, one category will
always be predicted.$known
is a character string indicating which gene signature set
was checked. Either one of the sets in knownSignatures
, or the
string "user specified"
.$knownSigs
is the number of signatures evaluated (equal to
length(knownSignatures)
, minus any signatures with zero features
matching the labels in expressionSet
.)$rank
is the performance rank of the primary signature classifier
on the original dataset amongst the performances of the known signatures
on the same dataset.$performanceKnown
is a vector of performance scores (proportion
of the validation set correctly predicted) for each known signature
on the dataset.
sigCheckClassifier
is called for each of the known signatures.
knownSignatures
, sigCheck
,
sigCheckClassifier
, sigCheckRandom
,
sigCheckPermuted
, MLearn
library(breastCancerNKI)
data(nki)
nki <- nki[,!is.na(nki$e.dmfs)]
data(knownSignatures)
results <- sigCheckKnown(nki, classes="e.dmfs",
signature=knownSignatures$cancer$VANTVEER,
annotation="HUGO.gene.symbol",
validationSamples=275:319)
Run the code above in your browser using DataLab