Forgot Password

If you have forgotten your password you can enter your email here and get a temporary password sent to your email.

Evolving gene/transcript definitions significantly alter the interpretation of GeneChip data.

Genome-wide expression profiling is a powerful tool for implicating novel gene ensembles in cellular mechanisms of health and disease. The most popular platform for genome-wide expression profiling is the Affymetrix GeneChip. However, its selection of probes relied on earlier genome and transcriptome annotation which is significantly different from current knowledge. The resultant informatics problems have a profound impact on analysis and interpretation the data. Here, we address these critical issues and offer a solution. We identified several classes of problems at the individual probe level in the existing annotation, under the assumption that current genome and transcriptome databases are more accurate than those used for GeneChip design. We then reorganized probes on more than a dozen popular GeneChips into gene-, transcript- and exon-specific probe sets in light of up-to-date genome, cDNA/EST clustering and single nucleotide polymorphism information. Comparing analysis results between the original and the redefined probe sets reveals approximately 30-50% discrepancy in the genes previously identified as differentially expressed, regardless of analysis method. Our results demonstrate that the original Affymetrix probe set definitions are inaccurate, and many conclusions derived from past GeneChip analyses may be significantly flawed. It will be beneficial to re-analyze existing GeneChip data with updated probe set definitions.

Pubmed ID: 16284200


  • Dai M
  • Wang P
  • Boyd AD
  • Kostov G
  • Athey B
  • Jones EG
  • Bunney WE
  • Myers RM
  • Speed TP
  • Akil H
  • Watson SJ
  • Meng F


Nucleic acids research

Publication Data

November 14, 2005

Associated Grants


Mesh Terms

  • Alleles
  • Animals
  • Cluster Analysis
  • Databases, Nucleic Acid
  • Exons
  • Gene Expression Profiling
  • Humans
  • Mice
  • Oligonucleotide Array Sequence Analysis
  • Oligonucleotide Probes
  • Rats
  • Reproducibility of Results
  • Transcription, Genetic