• Register
X
Forgot Password

If you have forgotten your password you can enter your email here and get a temporary password sent to your email.

X

Leaving Community

Are you sure you want to leave this community? Leaving the community will revoke any permissions you have been granted in this community.

No
Yes

MaCH-admix: genotype imputation for admixed populations.

Imputation in admixed populations is an important problem but challenging due to the complex linkage disequilibrium (LD) pattern. The emergence of large reference panels such as that from the 1,000 Genomes Project enables more accurate imputation in general, and in particular for admixed populations and for uncommon variants. To efficiently benefit from these large reference panels, one key issue to consider in modern genotype imputation framework is the selection of effective reference panels. In this work, we consider a number of methods for effective reference panel construction inside a hidden Markov model and specific to each target individual. These methods fall into two categories: identity-by-state (IBS) based and ancestry-weighted approach. We evaluated the performance on individuals from recently admixed populations. Our target samples include 8,421 African Americans and 3,587 Hispanic Americans from the Women' Health Initiative, which allow assessment of imputation quality for uncommon variants. Our experiments include both large and small reference panels; large, medium, and small target samples; and in genome regions of varying levels of LD. We also include BEAGLE and IMPUTE2 for comparison. Experiment results with large reference panel suggest that our novel piecewise IBS method yields consistently higher imputation quality than other methods/software. The advantage is particularly noteworthy among uncommon variants where we observe up to 5.1% information gain with the difference being highly significant (Wilcoxon signed rank test P-value < 0.0001). Our work is the first that considers various sensible approaches for imputation in admixed populations and presents a comprehensive comparison.

Pubmed ID: 23074066

Authors

  • Liu EY
  • Li M
  • Wang W
  • Li Y

Journal

Genetic epidemiology

Publication Data

January 17, 2013

Associated Grants

  • Agency: NIMH NIH HHS, Id: 1-RC2-MH089951-01
  • Agency: NCI NIH HHS, Id: P30 CA016042
  • Agency: NHGRI NIH HHS, Id: R01 HG005854
  • Agency: NHGRI NIH HHS, Id: R01 HG006292
  • Agency: NHGRI NIH HHS, Id: R01 HG006703
  • Agency: NHGRI NIH HHS, Id: R01HG004517
  • Agency: NHGRI NIH HHS, Id: R01HG005854
  • Agency: NHGRI NIH HHS, Id: R01HG006292
  • Agency: NHGRI NIH HHS, Id: R01HG006703
  • Agency: NCI NIH HHS, Id: U01 CA134240

Mesh Terms

  • African Americans
  • Female
  • Genetics, Population
  • Genome, Human
  • Genotype
  • HapMap Project
  • Haplotypes
  • Hispanic Americans
  • Humans
  • Linkage Disequilibrium
  • Markov Chains
  • Models, Genetic
  • Pedigree
  • Software