Forgot Password

If you have forgotten your password you can enter your email here and get a temporary password sent to your email.

De novo assembly and genotyping of variants using colored de Bruijn graphs.

Detecting genetic variants that are highly divergent from a reference sequence remains a major challenge in genome sequencing. We introduce de novo assembly algorithms using colored de Bruijn graphs for detecting and genotyping simple and complex genetic variants in an individual or population. We provide an efficient software implementation, Cortex, the first de novo assembler capable of assembling multiple eukaryotic genomes simultaneously. Four applications of Cortex are presented. First, we detect and validate both simple and complex structural variations in a high-coverage human genome. Second, we identify more than 3 Mb of sequence absent from the human reference genome, in pooled low-coverage population sequence data from the 1000 Genomes Project. Third, we show how population information from ten chimpanzees enables accurate variant calls without a reference sequence. Last, we estimate classical human leukocyte antigen (HLA) genotypes at HLA-B, the most variable gene in the human genome.

Pubmed ID: 22231483


  • Iqbal Z
  • Caccamo M
  • Turner I
  • Flicek P
  • McVean G


Nature genetics

Publication Data

February 27, 2012

Associated Grants

  • Agency: Wellcome Trust, Id: 085532
  • Agency: Wellcome Trust, Id: 086084
  • Agency: Wellcome Trust, Id: 090532
  • Agency: Wellcome Trust, Id: 090532/Z/09/Z
  • Agency: Wellcome Trust, Id: WT086084/Z/08/Z

Mesh Terms

  • Algorithms
  • Animals
  • Base Sequence
  • Chromosome Mapping
  • Genome, Human
  • Genotyping Techniques
  • HLA-B Antigens
  • Humans
  • Pan troglodytes
  • Sequence Analysis, DNA
  • Software