clusterComp: Estimate Microarray Cluster Stability

Description

This function estimates the stability of clustering solutions using microarray data. Currently only agglomerative hierarchical clustering is supported.

Usage

"clusterComp"(object, cl, seednum = NULL, B = 100,
sub.frac = 0.8, method = "ave", distmeth = "euclidean", adj.score = FALSE)
"clusterComp"(object, cl, seednum = NULL, B = 100,
sub.frac = 0.8, method = "ave", distmeth = "euclidean", adj.score = FALSE)

Arguments

object

Either a matrix or ExpressionSet

The number of clusters. This may be estimated using benhur

seednum

A value to pass to set.seed, which will allow for exact reproducibility at a later date.

The number of permutations.

sub.frac

The proportion of genes to use in each subsample. This value should be in the range of 0.75 - 0.85 for best results

method

The linkage method to pass to hclust. Valid values include "average", "centroid", "ward", "single", "mcquitty", or "median".

distmeth

The distance method to use. Valid values include "euclidean" and "pearson", where pearson implies 1-pearson correlation.

adj.score

Boolean. Should the stability scores be adjusted for cluster size? Defaults to FALSE.

Value

The output from this function is an object of class clusterComp. See the clusterComp-class man page for more information.

Details

This function estimates the stability of a clustering solution by repeatedly subsampling the data and comparing the cluster membership of the subsamples to the original clusters.

References

A. Ben-Hur, A. Elisseeff and I. Guyon. A stability based method for discovering structure in clustered data. Pacific Symposium on Biocomputing, 2002. Smolkin, M. and Ghosh, D. (2003). Cluster stability scores for microarray data in cancer studies . BMC Bioinformatics 4, 36 - 42.

Examples

Run this code

data(sample.ExpressionSet)
clusterComp(sample.ExpressionSet, 3)

Run the code above in your browser using DataLab