• Register
X
Forgot Password

If you have forgotten your password you can enter your email here and get a temporary password sent to your email.

X

Leaving Community

Are you sure you want to leave this community? Leaving the community will revoke any permissions you have been granted in this community.

No
Yes

Automating sequence-based detection and genotyping of SNPs from diploid samples.

The detection of sequence variation, for which DNA sequencing has emerged as the most sensitive and automated approach, forms the basis of all genetic analysis. Here we describe and illustrate an algorithm that accurately detects and genotypes SNPs from fluorescence-based sequence data. Because the algorithm focuses particularly on detecting SNPs through the identification of heterozygous individuals, it is especially well suited to the detection of SNPs in diploid samples obtained after DNA amplification. It is substantially more accurate than existing approaches and, notably, provides a useful quantitative measure of its confidence in each potential SNP detected and in each genotype called. Calls assigned the highest confidence are sufficiently reliable to remove the need for manual review in several contexts. For example, for sequence data from 47-90 individuals sequenced on both the forward and reverse strands, the highest-confidence calls from our algorithm detected 93% of all SNPs and 100% of high-frequency SNPs, with no false positive SNPs identified and 99.9% genotyping accuracy. This algorithm is implemented in a software package, PolyPhred version 5.0, which is freely available for academic use.

Pubmed ID: 16493422

Authors

  • Stephens M
  • Sloan JS
  • Robertson PD
  • Scheet P
  • Nickerson DA

Journal

Nature genetics

Publication Data

March 27, 2006

Associated Grants

  • Agency: NHGRI NIH HHS, Id: 1R01HG/LM-02585
  • Agency: NIEHS NIH HHS, Id: ES-15478
  • Agency: NHLBI NIH HHS, Id: HL-66682
  • Agency: NHGRI NIH HHS, Id: T32 HG00035-06

Mesh Terms

  • Algorithms
  • Automation
  • DNA
  • Diploidy
  • Genetic Variation
  • Genotype
  • Polymorphism, Single Nucleotide
  • Reproducibility of Results
  • Sensitivity and Specificity