• Register
X
Forgot Password

If you have forgotten your password you can enter your email here and get a temporary password sent to your email.

X

Leaving Community

Are you sure you want to leave this community? Leaving the community will revoke any permissions you have been granted in this community.

No
Yes

Robust replication of genotype-phenotype associations across multiple diseases in an electronic medical record.

Large-scale DNA databanks linked to electronic medical record (EMR) systems have been proposed as an approach for rapidly generating large, diverse cohorts for discovery and replication of genotype-phenotype associations. However, the extent to which such resources are capable of delivering on this promise is unknown. We studied whether an EMR-linked DNA biorepository can be used to detect known genotype-phenotype associations for five diseases. Twenty-one SNPs previously implicated as common variants predisposing to atrial fibrillation, Crohn disease, multiple sclerosis, rheumatoid arthritis, or type 2 diabetes were successfully genotyped in 9483 samples accrued over 4 mo into BioVU, the Vanderbilt University Medical Center DNA biobank. Previously reported odds ratios (OR(PR)) ranged from 1.14 to 2.36. For each phenotype, natural language processing techniques and billing-code queries were used to identify cases (n = 70-698) and controls (n = 808-3818) from deidentified health records. Each of the 21 tests of association yielded point estimates in the expected direction. Previous genotype-phenotype associations were replicated (p < 0.05) in 8/14 cases when the OR(PR) was > 1.25, and in 0/7 with lower OR(PR). Statistically significant associations were detected in all analyses that were adequately powered. In each of the five diseases studied, at least one previously reported association was replicated. These data demonstrate that phenotypes representing clinical diagnoses can be extracted from EMR systems, and they support the use of DNA resources coupled to EMR systems as tools for rapid generation of large data sets required for replication of associations found in research cohorts and for discovery in genome science.

Pubmed ID: 20362271

Authors

  • Ritchie MD
  • Denny JC
  • Crawford DC
  • Ramirez AH
  • Weiner JB
  • Pulley JM
  • Basford MA
  • Brown-Gentry K
  • Balser JR
  • Masys DR
  • Haines JL
  • Roden DM

Journal

American journal of human genetics

Publication Data

April 9, 2010

Associated Grants

  • Agency: NCRR NIH HHS, Id: 1UL1 RR024975-01
  • Agency: NHGRI NIH HHS, Id: U01 HG004603
  • Agency: NHGRI NIH HHS, Id: U01 HG04603
  • Agency: NCRR NIH HHS, Id: UL1 RR024975

Mesh Terms

  • Arthritis, Rheumatoid
  • Atrial Fibrillation
  • Case-Control Studies
  • Crohn Disease
  • DNA
  • Diabetes Mellitus, Type 2
  • Electronic Health Records
  • Genetic Association Studies
  • Genome, Human
  • Genome-Wide Association Study
  • Genotype
  • Humans
  • Multiple Sclerosis
  • Phenotype
  • Polymorphism, Single Nucleotide