A cluster analysis is the most commonly performed procedure (often regarded as a first step) on a set of gene expression profiles. In most cases, a post hoc analysis is done to see if the genes in the same clusters can be functionally correlated. While past successes of such analyses have often been reported in a number of microarray studies (most of which used the standard hierarchical clustering, UPGMA, with one minus the Pearson's correlation coefficient as a measure of dissimilarity), often times such groupings could be misleading. More importantly, a systematic evaluation of the entire set of clusters produced by such unsupervised procedures is necessary since they also contain genes that are seemingly unrelated or may have more than one common function. Here we quantify the performance of a given unsupervised clustering algorithm applied to a given microarray study in terms of its ability to produce biologically meaningful clusters using a reference set of functional classes. Such a reference set may come from prior biological knowledge specific to a microarray study or may be formed using the growing databases of gene ontologies (GO) for the annotated genes of the relevant species.
Pubmed ID: 16945146 RIS Download
Publication data is provided by the National Library of Medicine ® and PubMed ®. Data is retrieved from PubMed ® on a weekly schedule. For terms and conditions see the National Library of Medicine Terms and Conditions.
Web tool to search, sort, analyze, visualize and download data of interest. Along with providing details of the ontologies, gene products and annotations, features a BLAST search, Term Enrichment and GO Slimmer tools, the GO Online SQL Environment and a user help guide.Used at the Gene Ontology (GO) website to access the data provided by the GO Consortium. Developed and maintained by the GO Consortium.
View all literature mentionsSoftware application that provides researchers with the tools necessary for various types of statistical genetic analysis of human family data. (entry from Genetic Analysis Software)
View all literature mentions