2024MAY02: Our hosting provider has resolved some DB connectivity issues. We may experience some more outages as the issue is resolved. We apologize for the inconvenience. Dismiss and don't show again

Searching across hundreds of databases

Our searching services are busy right now. Your search will reload in five seconds.

X
Forgot Password

If you have forgotten your password you can enter your email here and get a temporary password sent to your email.

X
Forgot Password

If you have forgotten your password you can enter your email here and get a temporary password sent to your email.

This service exclusively searches for literature that cites resources. Please be aware that the total number of searchable documents is limited to those containing RRIDs and does not include all open-access literature.

Search

Type in a keyword to search

On page 1 showing 1 ~ 20 papers out of 98 papers

Common dysregulation network in the human prefrontal cortex underlies two neurodegenerative diseases.

  • Manikandan Narayanan‎ et al.
  • Molecular systems biology‎
  • 2014‎

Using expression profiles from postmortem prefrontal cortex samples of 624 dementia patients and non-demented controls, we investigated global disruptions in the co-regulation of genes in two neurodegenerative diseases, late-onset Alzheimer's disease (AD) and Huntington's disease (HD). We identified networks of differentially co-expressed (DC) gene pairs that either gained or lost correlation in disease cases relative to the control group, with the former dominant for both AD and HD and both patterns replicating in independent human cohorts of AD and aging. When aligning networks of DC patterns and physical interactions, we identified a 242-gene subnetwork enriched for independent AD/HD signatures. This subnetwork revealed a surprising dichotomy of gained/lost correlations among two inter-connected processes, chromatin organization and neural differentiation, and included DNA methyltransferases, DNMT1 and DNMT3A, of which we predicted the former but not latter as a key regulator. To validate the inter-connection of these two processes and our key regulator prediction, we generated two brain-specific knockout (KO) mice and show that Dnmt1 KO signature significantly overlaps with the subnetwork (P = 3.1 × 10(-12)), while Dnmt3a KO signature does not (P = 0.017).


Stochastic specification of primordial germ cells from mesoderm precursors in axolotl embryos.

  • Jodie Chatfield‎ et al.
  • Development (Cambridge, England)‎
  • 2014‎

A common feature of development in most vertebrate models is the early segregation of the germ line from the soma. For example, in Xenopus and zebrafish embryos primordial germ cells (PGCs) are specified by germ plasm that is inherited from the egg; in mice, Blimp1 expression in the epiblast mediates the commitment of cells to the germ line. How these disparate mechanisms of PGC specification evolved is unknown. Here, in order to identify the ancestral mechanism of PGC specification in vertebrates, we studied PGC specification in embryos from the axolotl (Mexican salamander), a model for the tetrapod ancestor. In the axolotl, PGCs develop within mesoderm, and classic studies have reported their induction from primitive ectoderm (animal cap). We used an axolotl animal cap system to demonstrate that signalling through FGF and BMP4 induces PGCs. The role of FGF was then confirmed in vivo. We also showed PGC induction by Brachyury, in the presence of BMP4. These conditions induced pluripotent mesodermal precursors that give rise to a variety of somatic cell types, in addition to PGCs. Irreversible restriction of the germ line did not occur until the mid-tailbud stage, days after the somatic germ layers are established. Before this, germline potential was maintained by MAP kinase signalling. We propose that this stochastic mechanism of PGC specification, from mesodermal precursors, is conserved in vertebrates.


Genome-wide Trans-ethnic Meta-analysis Identifies Seven Genetic Loci Influencing Erythrocyte Traits and a Role for RBPMS in Erythropoiesis.

  • Frank J A van Rooij‎ et al.
  • American journal of human genetics‎
  • 2017‎

Genome-wide association studies (GWASs) have identified loci for erythrocyte traits in primarily European ancestry populations. We conducted GWAS meta-analyses of six erythrocyte traits in 71,638 individuals from European, East Asian, and African ancestries using a Bayesian approach to account for heterogeneity in allelic effects and variation in the structure of linkage disequilibrium between ethnicities. We identified seven loci for erythrocyte traits including a locus (RBPMS/GTF2E2) associated with mean corpuscular hemoglobin and mean corpuscular volume. Statistical fine-mapping at this locus pointed to RBPMS at this locus and excluded nearby GTF2E2. Using zebrafish morpholino to evaluate loss of function, we observed a strong in vivo erythropoietic effect for RBPMS but not for GTF2E2, supporting the statistical fine-mapping at this locus and demonstrating that RBPMS is a regulator of erythropoiesis. Our findings show the utility of trans-ethnic GWASs for discovery and characterization of genetic loci influencing hematologic traits.


Genome-wide association analysis of blood-pressure traits in African-ancestry individuals reveals common associated genes in African and non-African populations.

  • Nora Franceschini‎ et al.
  • American journal of human genetics‎
  • 2013‎

High blood pressure (BP) is more prevalent and contributes to more severe manifestations of cardiovascular disease (CVD) in African Americans than in any other United States ethnic group. Several small African-ancestry (AA) BP genome-wide association studies (GWASs) have been published, but their findings have failed to replicate to date. We report on a large AA BP GWAS meta-analysis that includes 29,378 individuals from 19 discovery cohorts and subsequent replication in additional samples of AA (n = 10,386), European ancestry (EA) (n = 69,395), and East Asian ancestry (n = 19,601). Five loci (EVX1-HOXA, ULK4, RSPO3, PLEKHG1, and SOX6) reached genome-wide significance (p < 1.0 × 10(-8)) for either systolic or diastolic BP in a transethnic meta-analysis after correction for multiple testing. Three of these BP loci (EVX1-HOXA, RSPO3, and PLEKHG1) lack previous associations with BP. We also identified one independent signal in a known BP locus (SOX6) and provide evidence for fine mapping in four additional validated BP loci. We also demonstrate that validated EA BP GWAS loci, considered jointly, show significant effects in AA samples. Consequently, these findings suggest that BP loci might have universal effects across studied populations, demonstrating that multiethnic samples are an essential component in identifying, fine mapping, and understanding their trait variability.


Large-scale genomic analyses link reproductive aging to hypothalamic signaling, breast cancer susceptibility and BRCA1-mediated DNA repair.

  • Felix R Day‎ et al.
  • Nature genetics‎
  • 2015‎

Menopause timing has a substantial impact on infertility and risk of disease, including breast cancer, but the underlying mechanisms are poorly understood. We report a dual strategy in ∼70,000 women to identify common and low-frequency protein-coding variation associated with age at natural menopause (ANM). We identified 44 regions with common variants, including two regions harboring additional rare missense alleles of large effect. We found enrichment of signals in or near genes involved in delayed puberty, highlighting the first molecular links between the onset and end of reproductive lifespan. Pathway analyses identified major association with DNA damage response (DDR) genes, including the first common coding variant in BRCA1 associated with any complex trait. Mendelian randomization analyses supported a causal effect of later ANM on breast cancer risk (∼6% increase in risk per year; P = 3 × 10(-14)), likely mediated by prolonged sex hormone exposure rather than DDR mechanisms.


Whole-Exome Sequencing Identifies Loci Associated with Blood Cell Traits and Reveals a Role for Alternative GFI1B Splice Variants in Human Hematopoiesis.

  • Linda M Polfus‎ et al.
  • American journal of human genetics‎
  • 2016‎

Circulating blood cell counts and indices are important indicators of hematopoietic function and a number of clinical parameters, such as blood oxygen-carrying capacity, inflammation, and hemostasis. By performing whole-exome sequence association analyses of hematologic quantitative traits in 15,459 community-dwelling individuals, followed by in silico replication in up to 52,024 independent samples, we identified two previously undescribed coding variants associated with lower platelet count: a common missense variant in CPS1 (rs1047891, MAF = 0.33, discovery + replication p = 6.38 × 10(-10)) and a rare synonymous variant in GFI1B (rs150813342, MAF = 0.009, discovery + replication p = 1.79 × 10(-27)). By performing CRISPR/Cas9 genome editing in hematopoietic cell lines and follow-up targeted knockdown experiments in primary human hematopoietic stem and progenitor cells, we demonstrate an alternative splicing mechanism by which the GFI1B rs150813342 variant suppresses formation of a GFI1B isoform that preferentially promotes megakaryocyte differentiation and platelet production. These results demonstrate how unbiased studies of natural variation in blood cell traits can provide insight into the regulation of human hematopoiesis.


Exome Genotyping Identifies Pleiotropic Variants Associated with Red Blood Cell Traits.

  • Nathalie Chami‎ et al.
  • American journal of human genetics‎
  • 2016‎

Red blood cell (RBC) traits are important heritable clinical biomarkers and modifiers of disease severity. To identify coding genetic variants associated with these traits, we conducted meta-analyses of seven RBC phenotypes in 130,273 multi-ethnic individuals from studies genotyped on an exome array. After conditional analyses and replication in 27,480 independent individuals, we identified 16 new RBC variants. We found low-frequency missense variants in MAP1A (rs55707100, minor allele frequency [MAF] = 3.3%, p = 2 × 10(-10) for hemoglobin [HGB]) and HNF4A (rs1800961, MAF = 2.4%, p < 3 × 10(-8) for hematocrit [HCT] and HGB). In African Americans, we identified a nonsense variant in CD36 associated with higher RBC distribution width (rs3211938, MAF = 8.7%, p = 7 × 10(-11)) and showed that it is associated with lower CD36 expression and strong allelic imbalance in ex vivo differentiated human erythroblasts. We also identified a rare missense variant in ALAS2 (rs201062903, MAF = 0.2%) associated with lower mean corpuscular volume and mean corpuscular hemoglobin (p < 8 × 10(-9)). Mendelian mutations in ALAS2 are a cause of sideroblastic anemia and erythropoietic protoporphyria. Gene-based testing highlighted three rare missense variants in PKLR, a gene mutated in Mendelian non-spherocytic hemolytic anemia, associated with HGB and HCT (SKAT p < 8 × 10(-7)). These rare, low-frequency, and common RBC variants showed pleiotropy, being also associated with platelet, white blood cell, and lipid traits. Our association results and functional annotation suggest the involvement of new genes in human erythropoiesis. We also confirm that rare and low-frequency variants play a role in the architecture of complex human traits, although their phenotypic effect is generally smaller than originally anticipated.


GWAS analysis of handgrip and lower body strength in older adults in the CHARGE consortium.

  • Amy M Matteini‎ et al.
  • Aging cell‎
  • 2016‎

Decline in muscle strength with aging is an important predictor of health trajectory in the elderly. Several factors, including genetics, are proposed contributors to variability in muscle strength. To identify genetic contributors to muscle strength, a meta-analysis of genomewide association studies of handgrip was conducted. Grip strength was measured using a handheld dynamometer in 27 581 individuals of European descent over 65 years of age from 14 cohort studies. Genomewide association analysis was conducted on ~2.7 million imputed and genotyped variants (SNPs). Replication of the most significant findings was conducted using data from 6393 individuals from three cohorts. GWAS of lower body strength was also characterized in a subset of cohorts. Two genomewide significant (P-value< 5 × 10(-8) ) and 39 suggestive (P-value< 5 × 10(-5) ) associations were observed from meta-analysis of the discovery cohorts. After meta-analysis with replication cohorts, genomewide significant association was observed for rs752045 on chromosome 8 (β = 0.47, SE = 0.08, P-value = 5.20 × 10(-10) ). This SNP is mapped to an intergenic region and is located within an accessible chromatin region (DNase hypersensitivity site) in skeletal muscle myotubes differentiated from the human skeletal muscle myoblasts cell line. This locus alters a binding motif of the CCAAT/enhancer-binding protein-β (CEBPB) that is implicated in muscle repair mechanisms. GWAS of lower body strength did not yield significant results. A common genetic variant in a chromosomal region that regulates myotube differentiation and muscle repair may contribute to variability in grip strength in the elderly. Further studies are needed to uncover the mechanisms that link this genetic variant with muscle strength.


Dynamic Role of trans Regulation of Gene Expression in Relation to Complex Traits.

  • Chen Yao‎ et al.
  • American journal of human genetics‎
  • 2017‎

Identifying causal genetic variants and understanding their mechanisms of effect on traits remains a challenge in genome-wide association studies (GWASs). In particular, how genetic variants (i.e., trans-eQTLs) affect expression of remote genes (i.e., trans-eGenes) remains unknown. We hypothesized that some trans-eQTLs regulate expression of distant genes by altering the expression of nearby genes (cis-eGenes). Using published GWAS datasets with 39,165 single-nucleotide polymorphisms (SNPs) associated with 1,960 traits, we explored whole blood gene expression associations of trait-associated SNPs in 5,257 individuals from the Framingham Heart Study. We identified 2,350 trans-eQTLs (at p < 10-7); more than 80% of them were found to have cis-associated eGenes. Mediation testing suggested that for 35% of trans-eQTL-trans-eGene pairs in different chromosomes and 90% pairs in the same chromosome, the disease-associated SNP may alter expression of the trans-eGene via cis-eGene expression. In addition, we identified 13 trans-eQTL hotspots, affecting from ten to hundreds of genes, suggesting the existence of master genetic regulators. Using causal inference testing, we searched causal variants across eight cardiometabolic traits (BMI, systolic and diastolic blood pressure, LDL cholesterol, HDL cholesterol, total cholesterol, triglycerides, and fasting blood glucose) and identified several cis-eGenes (ALDH2 for systolic and diastolic blood pressure, MCM6 and DARS for total cholesterol, and TRIB1 for triglycerides) that were causal mediators for the corresponding traits, as well as examples of trans-mediators (TAGAP for LDL cholesterol). The finding of extensive evidence of genome-wide mediation effects suggests a critical role of cryptic gene regulation underlying many disease traits.


HDAC9 is implicated in atherosclerotic aortic calcification and affects vascular smooth muscle cell phenotype.

  • Rajeev Malhotra‎ et al.
  • Nature genetics‎
  • 2019‎

Aortic calcification is an important independent predictor of future cardiovascular events. We performed a genome-wide association meta-analysis to determine SNPs associated with the extent of abdominal aortic calcification (n = 9,417) or descending thoracic aortic calcification (n = 8,422). Two genetic loci, HDAC9 and RAP1GAP, were associated with abdominal aortic calcification at a genome-wide level (P < 5.0 × 10-8). No SNPs were associated with thoracic aortic calcification at the genome-wide threshold. Increased expression of HDAC9 in human aortic smooth muscle cells promoted calcification and reduced contractility, while inhibition of HDAC9 in human aortic smooth muscle cells inhibited calcification and enhanced cell contractility. In matrix Gla protein-deficient mice, a model of human vascular calcification, mice lacking HDAC9 had a 40% reduction in aortic calcification and improved survival. This translational genomic study identifies the first genetic risk locus associated with calcification of the abdominal aorta and describes a previously unknown role for HDAC9 in the development of vascular calcification.


Trans-ethnic and Ancestry-Specific Blood-Cell Genetics in 746,667 Individuals from 5 Global Populations.

  • Ming-Huei Chen‎ et al.
  • Cell‎
  • 2020‎

Most loci identified by GWASs have been found in populations of European ancestry (EUR). In trans-ethnic meta-analyses for 15 hematological traits in 746,667 participants, including 184,535 non-EUR individuals, we identified 5,552 trait-variant associations at p < 5 × 10-9, including 71 novel associations not found in EUR populations. We also identified 28 additional novel variants in ancestry-specific, non-EUR meta-analyses, including an IL7 missense variant in South Asians associated with lymphocyte count in vivo and IL-7 secretion levels in vitro. Fine-mapping prioritized variants annotated as functional and generated 95% credible sets that were 30% smaller when using the trans-ethnic as opposed to the EUR-only results. We explored the clinical significance and predictive value of trans-ethnic variants in multiple populations and compared genetic architecture and the effect of natural selection on these blood phenotypes between populations. Altogether, our results for hematological traits highlight the value of a more global representation of populations in genetic studies.


The Polygenic and Monogenic Basis of Blood Traits and Diseases.

  • Dragana Vuckovic‎ et al.
  • Cell‎
  • 2020‎

Blood cells play essential roles in human health, underpinning physiological processes such as immunity, oxygen transport, and clotting, which when perturbed cause a significant global health burden. Here we integrate data from UK Biobank and a large-scale international collaborative effort, including data for 563,085 European ancestry participants, and discover 5,106 new genetic variants independently associated with 29 blood cell phenotypes covering a range of variation impacting hematopoiesis. We holistically characterize the genetic architecture of hematopoiesis, assess the relevance of the omnigenic model to blood cell phenotypes, delineate relevant hematopoietic cell states influenced by regulatory genetic variants and gene networks, identify novel splice-altering variants mediating the associations, and assess the polygenic prediction potential for blood traits and clinical disorders at the interface of complex and Mendelian genetics. These results show the power of large-scale blood cell trait GWAS to interrogate clinically meaningful variants across a wide allelic spectrum of human variation.


Sequencing of 53,831 diverse genomes from the NHLBI TOPMed Program.

  • Daniel Taliun‎ et al.
  • Nature‎
  • 2021‎

The Trans-Omics for Precision Medicine (TOPMed) programme seeks to elucidate the genetic architecture and biology of heart, lung, blood and sleep disorders, with the ultimate goal of improving diagnosis, treatment and prevention of these diseases. The initial phases of the programme focused on whole-genome sequencing of individuals with rich phenotypic data and diverse backgrounds. Here we describe the TOPMed goals and design as well as the available resources and early insights obtained from the sequence data. The resources include a variant browser, a genotype imputation server, and genomic and phenotypic data that are available through dbGaP (Database of Genotypes and Phenotypes)1. In the first 53,831 TOPMed samples, we detected more than 400 million single-nucleotide and insertion or deletion variants after alignment with the reference genome. Additional previously undescribed variants were detected through assembly of unmapped reads and customized analysis in highly variable loci. Among the more than 400 million detected variants, 97% have frequencies of less than 1% and 46% are singletons that are present in only one individual (53% among unrelated individuals). These rare variants provide insights into mutational processes and recent human evolutionary history. The extensive catalogue of genetic variation in TOPMed studies provides unique opportunities for exploring the contributions of rare and noncoding sequence variants to phenotypic variation. Furthermore, combining TOPMed haplotypes with modern imputation methods improves the power and reach of genome-wide association studies to include variants down to a frequency of approximately 0.01%.


Use of >100,000 NHLBI Trans-Omics for Precision Medicine (TOPMed) Consortium whole genome sequences improves imputation quality and detection of rare variant associations in admixed African and Hispanic/Latino populations.

  • Madeline H Kowalski‎ et al.
  • PLoS genetics‎
  • 2019‎

Most genome-wide association and fine-mapping studies to date have been conducted in individuals of European descent, and genetic studies of populations of Hispanic/Latino and African ancestry are limited. In addition, these populations have more complex linkage disequilibrium structure. In order to better define the genetic architecture of these understudied populations, we leveraged >100,000 phased sequences available from deep-coverage whole genome sequencing through the multi-ethnic NHLBI Trans-Omics for Precision Medicine (TOPMed) program to impute genotypes into admixed African and Hispanic/Latino samples with genome-wide genotyping array data. We demonstrated that using TOPMed sequencing data as the imputation reference panel improves genotype imputation quality in these populations, which subsequently enhanced gene-mapping power for complex traits. For rare variants with minor allele frequency (MAF) < 0.5%, we observed a 2.3- to 6.1-fold increase in the number of well-imputed variants, with 11-34% improvement in average imputation quality, compared to the state-of-the-art 1000 Genomes Project Phase 3 and Haplotype Reference Consortium reference panels. Impressively, even for extremely rare variants with minor allele count <10 (including singletons) in the imputation target samples, average information content rescued was >86%. Subsequent association analyses of TOPMed reference panel-imputed genotype data with hematological traits (hemoglobin (HGB), hematocrit (HCT), and white blood cell count (WBC)) in ~21,600 African-ancestry and ~21,700 Hispanic/Latino individuals identified associations with two rare variants in the HBB gene (rs33930165 with higher WBC [p = 8.8x10-15] in African populations, rs11549407 with lower HGB [p = 1.5x10-12] and HCT [p = 8.8x10-10] in Hispanics/Latinos). By comparison, neither variant would have been genome-wide significant if either 1000 Genomes Project Phase 3 or Haplotype Reference Consortium reference panels had been used for imputation. Our findings highlight the utility of the TOPMed imputation reference panel for identification of novel rare variant associations not previously detected in similarly sized genome-wide studies of under-represented African and Hispanic/Latino populations.


Impact of Rare and Common Genetic Variants on Diabetes Diagnosis by Hemoglobin A1c in Multi-Ancestry Cohorts: The Trans-Omics for Precision Medicine Program.

  • Chloé Sarnowski‎ et al.
  • American journal of human genetics‎
  • 2019‎

Hemoglobin A1c (HbA1c) is widely used to diagnose diabetes and assess glycemic control in individuals with diabetes. However, nonglycemic determinants, including genetic variation, may influence how accurately HbA1c reflects underlying glycemia. Analyzing the NHLBI Trans-Omics for Precision Medicine (TOPMed) sequence data in 10,338 individuals from five studies and four ancestries (6,158 Europeans, 3,123 African-Americans, 650 Hispanics, and 407 East Asians), we confirmed five regions associated with HbA1c (GCK in Europeans and African-Americans, HK1 in Europeans and Hispanics, FN3K and/or FN3KRP in Europeans, and G6PD in African-Americans and Hispanics) and we identified an African-ancestry-specific low-frequency variant (rs1039215 in HBG2 and HBE1, minor allele frequency (MAF) = 0.03). The most associated G6PD variant (rs1050828-T, p.Val98Met, MAF = 12% in African-Americans, MAF = 2% in Hispanics) lowered HbA1c (-0.88% in hemizygous males, -0.34% in heterozygous females) and explained 23% of HbA1c variance in African-Americans and 4% in Hispanics. Additionally, we identified a rare distinct G6PD coding variant (rs76723693, p.Leu353Pro, MAF = 0.5%; -0.98% in hemizygous males, -0.46% in heterozygous females) and detected significant association with HbA1c when aggregating rare missense variants in G6PD. We observed similar magnitude and direction of effects for rs1039215 (HBG2) and rs76723693 (G6PD) in the two largest TOPMed African American cohorts, and we replicated the rs76723693 association in the UK Biobank African-ancestry participants. These variants in G6PD and HBG2 were monomorphic in the European and Asian samples. African or Hispanic ancestry individuals carrying G6PD variants may be underdiagnosed for diabetes when screened with HbA1c. Thus, assessment of these variants should be considered for incorporation into precision medicine approaches for diabetes diagnosis.


Clonal hematopoiesis associated with epigenetic aging and clinical outcomes.

  • Daniel Nachun‎ et al.
  • Aging cell‎
  • 2021‎

Clonal hematopoiesis of indeterminate potential (CHIP) is a common precursor state for blood cancers that most frequently occurs due to mutations in the DNA-methylation modifying enzymes DNMT3A or TET2. We used DNA-methylation array and whole-genome sequencing data from four cohorts together comprising 5522 persons to study the association between CHIP, epigenetic clocks, and health outcomes. CHIP was strongly associated with epigenetic age acceleration, defined as the residual after regressing epigenetic clock age on chronological age, in several clocks, ranging from 1.31 years (GrimAge, p < 8.6 × 10-7 ) to 3.08 years (EEAA, p < 3.7 × 10-18 ). Mutations in most CHIP genes except DNA-damage response genes were associated with increases in several measures of age acceleration. CHIP carriers with mutations in multiple genes had the largest increases in age acceleration and decrease in estimated telomere length. Finally, we found that ~40% of CHIP carriers had acceleration >0 in both Hannum and GrimAge (referred to as AgeAccelHG+). This group was at high risk of all-cause mortality (hazard ratio 2.90, p < 4.1 × 10-8 ) and coronary heart disease (CHD) (hazard ratio 3.24, p < 9.3 × 10-6 ) compared to those who were CHIP-/AgeAccelHG-. In contrast, the other ~60% of CHIP carriers who were AgeAccelHG- were not at increased risk of these outcomes. In summary, CHIP is strongly linked to age acceleration in multiple clocks, and the combination of CHIP and epigenetic aging may be used to identify a population at high risk for adverse outcomes and who may be a target for clinical interventions.


A System for Phenotype Harmonization in the National Heart, Lung, and Blood Institute Trans-Omics for Precision Medicine (TOPMed) Program.

  • Adrienne M Stilp‎ et al.
  • American journal of epidemiology‎
  • 2021‎

Genotype-phenotype association studies often combine phenotype data from multiple studies to increase statistical power. Harmonization of the data usually requires substantial effort due to heterogeneity in phenotype definitions, study design, data collection procedures, and data-set organization. Here we describe a centralized system for phenotype harmonization that includes input from phenotype domain and study experts, quality control, documentation, reproducible results, and data-sharing mechanisms. This system was developed for the National Heart, Lung, and Blood Institute's Trans-Omics for Precision Medicine (TOPMed) program, which is generating genomic and other -omics data for more than 80 studies with extensive phenotype data. To date, 63 phenotypes have been harmonized across thousands of participants (recruited in 1948-2012) from up to 17 studies per phenotype. Here we discuss challenges in this undertaking and how they were addressed. The harmonized phenotype data and associated documentation have been submitted to National Institutes of Health data repositories for controlled access by the scientific community. We also provide materials to facilitate future harmonization efforts by the community, which include 1) the software code used to generate the 63 harmonized phenotypes, enabling others to reproduce, modify, or extend these harmonizations to additional studies, and 2) the results of labeling thousands of phenotype variables with controlled vocabulary terms.


Whole-genome sequencing association analysis of quantitative red blood cell phenotypes: The NHLBI TOPMed program.

  • Yao Hu‎ et al.
  • American journal of human genetics‎
  • 2021‎

Whole-genome sequencing (WGS), a powerful tool for detecting novel coding and non-coding disease-causing variants, has largely been applied to clinical diagnosis of inherited disorders. Here we leveraged WGS data in up to 62,653 ethnically diverse participants from the NHLBI Trans-Omics for Precision Medicine (TOPMed) program and assessed statistical association of variants with seven red blood cell (RBC) quantitative traits. We discovered 14 single variant-RBC trait associations at 12 genomic loci, which have not been reported previously. Several of the RBC trait-variant associations (RPN1, ELL2, MIDN, HBB, HBA1, PIEZO1, and G6PD) were replicated in independent GWAS datasets imputed to the TOPMed reference panel. Most of these discovered variants are rare/low frequency, and several are observed disproportionately among non-European Ancestry (African, Hispanic/Latino, or East Asian) populations. We identified a 3 bp indel p.Lys2169del (g.88717175_88717177TCT[4]) (common only in the Ashkenazi Jewish population) of PIEZO1, a gene responsible for the Mendelian red cell disorder hereditary xerocytosis (MIM: 194380), associated with higher mean corpuscular hemoglobin concentration (MCHC). In stepwise conditional analysis and in gene-based rare variant aggregated association analysis, we identified several of the variants in HBB, HBA1, TMPRSS6, and G6PD that represent the carrier state for known coding, promoter, or splice site loss-of-function variants that cause inherited RBC disorders. Finally, we applied base and nuclease editing to demonstrate that the sentinel variant rs112097551 (nearest gene RPN1) acts through a cis-regulatory element that exerts long-range control of the gene RUVBL1 which is essential for hematopoiesis. Together, these results demonstrate the utility of WGS in ethnically diverse population-based samples and gene editing for expanding knowledge of the genetic architecture of quantitative hematologic traits and suggest a continuum between complex trait and Mendelian red cell disorders.


LINE-1 transcription in round spermatids is associated with accretion of 5-carboxylcytosine in their open reading frames.

  • Martin J Blythe‎ et al.
  • Communications biology‎
  • 2021‎

Chromatin of male and female gametes undergoes a number of reprogramming events during the transition from germ cell to embryonic developmental programs. Although the rearrangement of DNA methylation patterns occurring in the zygote has been extensively characterized, little is known about the dynamics of DNA modifications during spermatid maturation. Here, we demonstrate that the dynamics of 5-carboxylcytosine (5caC) correlate with active transcription of LINE-1 retroelements during murine spermiogenesis. We show that the open reading frames of active and evolutionary young LINE-1s are 5caC-enriched in round spermatids and 5caC is eliminated from LINE-1s and spermiogenesis-specific genes during spermatid maturation, being simultaneously retained at promoters and introns of developmental genes. Our results reveal an association of 5caC with activity of LINE-1 retrotransposons suggesting a potential direct role for this DNA modification in fine regulation of their transcription.


NANOG is required to establish the competence for germ-layer differentiation in the basal tetrapod axolotl.

  • Luke A Simpson‎ et al.
  • PLoS biology‎
  • 2023‎

Pluripotency defines the unlimited potential of individual cells of vertebrate embryos, from which all adult somatic cells and germ cells are derived. Understanding how the programming of pluripotency evolved has been obscured in part by a lack of data from lower vertebrates; in model systems such as frogs and zebrafish, the function of the pluripotency genes NANOG and POU5F1 have diverged. Here, we investigated how the axolotl ortholog of NANOG programs pluripotency during development. Axolotl NANOG is absolutely required for gastrulation and germ-layer commitment. We show that in axolotl primitive ectoderm (animal caps; ACs) NANOG and NODAL activity, as well as the epigenetic modifying enzyme DPY30, are required for the mass deposition of H3K4me3 in pluripotent chromatin. We also demonstrate that all 3 protein activities are required for ACs to establish the competency to differentiate toward mesoderm. Our results suggest the ancient function of NANOG may be establishing the competence for lineage differentiation in early cells. These observations provide insights into embryonic development in the tetrapod ancestor from which terrestrial vertebrates evolved.


  1. SciCrunch.org Resources

    Welcome to the FDI Lab - SciCrunch.org Resources search. From here you can search through a compilation of resources used by FDI Lab - SciCrunch.org and see how data is organized within our community.

  2. Navigation

    You are currently on the Community Resources tab looking through categories and sources that FDI Lab - SciCrunch.org has compiled. You can navigate through those categories from here or change to a different tab to execute your search through. Each tab gives a different perspective on data.

  3. Logging in and Registering

    If you have an account on FDI Lab - SciCrunch.org then you can log in from here to get additional features in FDI Lab - SciCrunch.org such as Collections, Saved Searches, and managing Resources.

  4. Searching

    Here is the search term that is being executed, you can type in anything you want to search for. Some tips to help searching:

    1. Use quotes around phrases you want to match exactly
    2. You can manually AND and OR terms to change how we search between words
    3. You can add "-" to terms to make sure no results return with that term in them (ex. Cerebellum -CA1)
    4. You can add "+" to terms to require they be in the data
    5. Using autocomplete specifies which branch of our semantics you with to search and can help refine your search
  5. Save Your Search

    You can save any searches you perform for quick access to later from here.

  6. Query Expansion

    We recognized your search term and included synonyms and inferred terms along side your term to help get the data you are looking for.

  7. Collections

    If you are logged into FDI Lab - SciCrunch.org you can add data records to your collections to create custom spreadsheets across multiple sources of data.

  8. Facets

    Here are the facets that you can filter your papers by.

  9. Options

    From here we'll present any options for the literature, such as exporting your current results.

  10. Further Questions

    If you have any further questions please check out our FAQs Page to ask questions and see our tutorials. Click this button to view this tutorial again.

Publications Per Year

X

Year:

Count: