Searching across hundreds of databases

Our searching services are busy right now. Your search will reload in five seconds.

X
Forgot Password

If you have forgotten your password you can enter your email here and get a temporary password sent to your email.

X
Forgot Password

If you have forgotten your password you can enter your email here and get a temporary password sent to your email.

This service exclusively searches for literature that cites resources. Please be aware that the total number of searchable documents is limited to those containing RRIDs and does not include all open-access literature.

Search

Type in a keyword to search

On page 1 showing 1 ~ 20 papers out of 95 papers

Data for Genetic Analysis Workshop 18: human whole genome sequence, blood pressure, and simulated phenotypes in extended pedigrees.

  • Laura Almasy‎ et al.
  • BMC proceedings‎
  • 2014‎

Genetic Analysis Workshop 18 (GAW18) focused on identification of genes and functional variants that influence complex phenotypes in human sequence data. Data for the workshop were donated by the T2D-GENES Consortium and included whole genome sequences for odd-numbered autosomes in 464 key individuals selected from 20 Mexican American families, a dense set of single-nucleotide polymorphisms in 959 individuals in these families, and longitudinal data on systolic and diastolic blood pressure measured at 1-4 examinations over a period of 20 years. Simulated phenotypes were generated based on the real sequence data and pedigree structures. In the design of the simulation model, gene expression measures from the San Antonio Family Heart Study (not distributed as part of the GAW18 data) were used to identify genes whose mRNA levels were correlated with blood pressure. Observed variants within these genes were designated as functional in the GAW18 simulation if they were nonsynonymous and predicted to have deleterious effects on protein function or if they were noncoding and associated with mRNA levels. Two simulated longitudinal phenotypes were modeled to have the same trait distributions as the real systolic and diastolic blood pressure data, with effects of age, sex, and medication use, including a genotype-medication interaction. For each phenotype, more than 1000 sequence variants in more than 200 genes present on the odd-numbered autosomes individually explained less than 0.01-2.78% of phenotypic variance. Cumulatively, variants in the most influential gene explained 7.79% of trait variance. An additional simulated phenotype, Q1, was designed to be correlated among family members but to not be associated with any sequence variants. Two hundred replicates of the phenotypes were simulated, with each including data for 849 individuals.


Omics-squared: human genomic, transcriptomic and phenotypic data for genetic analysis workshop 19.

  • John Blangero‎ et al.
  • BMC proceedings‎
  • 2016‎

The Genetic Analysis Workshops (GAW) are a forum for development, testing, and comparison of statistical genetic methods and software. Each contribution to the workshop includes an application to a specified data set. Here we describe the data distributed for GAW19, which focused on analysis of human genomic and transcriptomic data.


Endogenous factor VIII synthesis from the intron 22-inverted F8 locus may modulate the immunogenicity of replacement therapy for hemophilia A.

  • Gouri Shankar Pandey‎ et al.
  • Nature medicine‎
  • 2013‎

Neutralizing antibodies (inhibitors) to replacement factor VIII (FVIII, either plasma derived or recombinant) impair the effective management of hemophilia A. Individuals with hemophilia A due to major deletions of the FVIII gene (F8) lack antigenically cross-reactive material in their plasma ("CRM-negative"), and the prevalence of inhibitors in these individuals may be as high as 90%. Conversely, individuals with hemophilia A caused by F8 missense mutations are CRM-positive, and their overall prevalence of inhibitors is <10% (ref. 2). Individuals with the F8 intron 22 inversion (found in ∼50% of individuals with severe hemophilia A) have been grouped with the former on the basis of their genetic defect and CRM-negative status. However, only ∼20% of these individuals develop inhibitors. Here we demonstrate that the levels of F8 mRNA and intracellular FVIII protein in B lymphoblastoid cells and liver biopsies from individuals with the intron 22 inversion are comparable to those in healthy controls. These results support the hypothesis that most individuals with the intron 22 inversion are tolerized to FVIII and thus do not develop inhibitors. Furthermore, we developed a new pharmacogenetic algorithm that permits the stratification of inhibitor risk for individuals and subpopulations by predicting the immunogenicity of replacement FVIII using, as input, the number of putative T cell epitopes in the infused protein and the competence of major histocompatibility complex class II molecules to present such epitopes. This algorithm showed statistically significant accuracy in predicting the presence of inhibitors in 25 unrelated individuals with the intron 22 inversion.


The genetic architecture of type 2 diabetes.

  • Christian Fuchsberger‎ et al.
  • Nature‎
  • 2016‎

The genetic architecture of common traits, including the number, frequency, and effect sizes of inherited variants that contribute to individual risk, has been long debated. Genome-wide association studies have identified scores of common variants associated with type 2 diabetes, but in aggregate, these explain only a fraction of the heritability of this disease. Here, to test the hypothesis that lower-frequency variants explain much of the remainder, the GoT2D and T2D-GENES consortia performed whole-genome sequencing in 2,657 European individuals with and without diabetes, and exome sequencing in 12,940 individuals from five ancestry groups. To increase statistical power, we expanded the sample size via genotyping and imputation in a further 111,548 subjects. Variants associated with type 2 diabetes after sequencing were overwhelmingly common and most fell within regions previously identified by genome-wide association studies. Comprehensive enumeration of sequence variation is necessary to identify functional alleles that provide important clues to disease pathophysiology, but large-scale sequencing does not support the idea that lower-frequency variants have a major role in predisposition to type 2 diabetes.


GWAS and transcriptional analysis prioritize ITPR1 and CNTN4 for a serum uric acid 3p26 QTL in Mexican Americans.

  • Geetha Chittoor‎ et al.
  • BMC genomics‎
  • 2016‎

The variation in serum uric acid concentrations is under significant genetic influence. Elevated SUA concentrations have been linked to increased risk for gout, kidney stones, chronic kidney disease, and cardiovascular disease whereas reduced serum uric acid concentrations have been linked to multiple sclerosis, Parkinson's disease and Alzheimer's disease. Previously, we identified a novel locus on chromosome 3p26 affecting serum uric acid concentrations in Mexican Americans from San Antonio Family Heart Study. As a follow up, we examined genome-wide single nucleotide polymorphism data in an extended cohort of 1281 Mexican Americans from multigenerational families of the San Antonio Family Heart Study and the San Antonio Family Diabetes/Gallbladder Study. We used a linear regression-based joint linkage/association test under an additive model of allelic effect, while accounting for non-independence among family members via a kinship variance component.


Large scale mitochondrial sequencing in Mexican Americans suggests a reappraisal of Native American origins.

  • Satish Kumar‎ et al.
  • BMC evolutionary biology‎
  • 2011‎

The Asian origin of Native Americans is largely accepted. However uncertainties persist regarding the source population(s) within Asia, the divergence and arrival time(s) of the founder groups, the number of expansion events, and migration routes into the New World. mtDNA data, presented over the past two decades, have been used to suggest a single-migration model for which the Beringian land mass plays an important role.


Novel associations of nonstructural Loci with paraoxonase activity.

  • Ellen E Quillen‎ et al.
  • Journal of lipids‎
  • 2012‎

The high-density-lipoprotein-(HDL-) associated esterase paraoxonase 1 (PON1) is a likely contributor to the antioxidant and antiatherosclerotic capabilities of HDL. Two nonsynonymous mutations in the structural gene, PON1, have been associated with variation in activity levels, but substantial interindividual differences remain unexplained and are greatest for substrates other than the eponymous paraoxon. PON1 activity levels were measured for three substrates-organophosphate paraoxon, arylester phenyl acetate, and lactone dihydrocoumarin-in 767 Mexican American individuals from San Antonio, Texas. Genetic influences on activity levels for each substrate were evaluated by association with approximately one million single nucleotide polymorphism (SNPs) while conditioning on PON1 genotypes. Significant associations were detected at five loci including regions on chromosomes 4 and 17 known to be associated with atherosclerosis and lipoprotein regulation and loci on chromosome 3 that regulate ubiquitous transcription factors. These loci explain 7.8% of variation in PON1 activity with lactone as a substrate, 5.6% with the arylester, and 3.0% with paraoxon. In light of the potential importance of PON1 in preventing cardiovascular disease/events, these novel loci merit further investigation.


A chromosome 11q quantitative-trait locus influences change of blood-pressure measurements over time in Mexican Americans of the San Antonio Family Heart Study.

  • Sue Rutherford‎ et al.
  • American journal of human genetics‎
  • 2007‎

Although previous genome scans have searched for quantitative-trait loci (QTLs) influencing variation in blood pressure (BP), few have investigated the rate of change in BP over time as a phenotype. Here, we compare results from genomewide scans to localize QTLs for systolic, diastolic, and mean arterial BPs (SBP, DBP, and MBP, respectively) and for rates of change in systolic, diastolic, and mean arterial BPs (rSBP, rDBP, and rMBP, respectively), with use of the longitudinal data collected about Mexican Americans of the San Antonio Family Heart Study (SAFHS). Significant evidence of linkage was found for rSBP (LOD 4.15) and for rMBP (LOD 3.94) near marker D11S4464 located on chromosome 11q24.1. This same chromosome 11q region also shows suggestive linkage to SBP (LOD 2.23) and MBP (LOD 2.37) measurements collected during the second clinic visit. Suggestive evidence of linkage to chromosome 5 was also found for rMBP, to chromosome 16 for rSBP, and to chromosomes 1, 5, 6, 7, and 21 for the single-time-point BP traits collected at the first two SAFHS clinic visits. We also present results from fine mapping the chromosome 11 QTL with use of SNP-association analysis within candidate genes identified from a bioinformatic search of the region and from whole-genome transcriptional expression data collected from 1,240 SAFHS participants. Our results show that the use of longitudinal BP data to calculate the rate of change in BP over time provides more information than do the single-time measurements, since they reveal physiological trends in the subjects that a single-time measurement could never capture. Further investigation of this region is necessary for the identification of the genetic variation responsible for QTLs influencing the rate of change in BP.


A Low-Frequency Inactivating AKT2 Variant Enriched in the Finnish Population Is Associated With Fasting Insulin Levels and Type 2 Diabetes Risk.

  • Alisa Manning‎ et al.
  • Diabetes‎
  • 2017‎

To identify novel coding association signals and facilitate characterization of mechanisms influencing glycemic traits and type 2 diabetes risk, we analyzed 109,215 variants derived from exome array genotyping together with an additional 390,225 variants from exome sequence in up to 39,339 normoglycemic individuals from five ancestry groups. We identified a novel association between the coding variant (p.Pro50Thr) in AKT2 and fasting plasma insulin (FI), a gene in which rare fully penetrant mutations are causal for monogenic glycemic disorders. The low-frequency allele is associated with a 12% increase in FI levels. This variant is present at 1.1% frequency in Finns but virtually absent in individuals from other ancestries. Carriers of the FI-increasing allele had increased 2-h insulin values, decreased insulin sensitivity, and increased risk of type 2 diabetes (odds ratio 1.05). In cellular studies, the AKT2-Thr50 protein exhibited a partial loss of function. We extend the allelic spectrum for coding variants in AKT2 associated with disorders of glucose homeostasis and demonstrate bidirectional effects of variants within the pleckstrin homology domain of AKT2.


Sequence data and association statistics from 12,940 type 2 diabetes cases and controls.

  • Jason Flannick‎ et al.
  • Scientific data‎
  • 2017‎

To investigate the genetic basis of type 2 diabetes (T2D) to high resolution, the GoT2D and T2D-GENES consortia catalogued variation from whole-genome sequencing of 2,657 European individuals and exome sequencing of 12,940 individuals of multiple ancestries. Over 27M SNPs, indels, and structural variants were identified, including 99% of low-frequency (minor allele frequency [MAF] 0.1-5%) non-coding variants in the whole-genome sequenced individuals and 99.7% of low-frequency coding variants in the whole-exome sequenced individuals. Each variant was tested for association with T2D in the sequenced individuals, and, to increase power, most were tested in larger numbers of individuals (>80% of low-frequency coding variants in ~82 K Europeans via the exome chip, and ~90% of low-frequency non-coding variants in ~44 K Europeans via genotype imputation). The variants, genotypes, and association statistics from these analyses provide the largest reference to date of human genetic information relevant to T2D, for use in activities such as T2D-focused genotype imputation, functional characterization of variants or genes, and other novel analyses to detect associations between sequence variation and T2D.


Further evidence supporting a potential role for ADH1B in obesity.

  • Liza D Morales‎ et al.
  • Scientific reports‎
  • 2021‎

Insulin is an essential hormone that regulates glucose homeostasis and metabolism. Insulin resistance (IR) arises when tissues fail to respond to insulin, and it leads to serious health problems including Type 2 Diabetes (T2D). Obesity is a major contributor to the development of IR and T2D. We previously showed that gene expression of alcohol dehydrogenase 1B (ADH1B) was inversely correlated with obesity and IR in subcutaneous adipose tissue of Mexican Americans. In the current study, a meta-analysis of the relationship between ADH1B expression and BMI in Mexican Americans, African Americans, Europeans, and Pima Indians verified that BMI was increased with decreased ADH1B expression. Using established human subcutaneous pre-adipocyte cell lines derived from lean (BMI < 30 kg m-2) or obese (BMI ≥ 30 kg m-2) donors, we found that ADH1B protein expression increased substantially during differentiation, and overexpression of ADH1B inhibited fatty acid binding protein expression. Mature adipocytes from lean donors expressed ADH1B at higher levels than obese donors. Insulin further induced ADH1B protein expression as well as enzyme activity. Knockdown of ADH1B expression decreased insulin-stimulated glucose uptake. Our findings suggest that ADH1B is involved in the proper development and metabolic activity of adipose tissues and this function is suppressed by obesity.


Associations of cannabis use disorder with cognition, brain structure, and brain function in African Americans.

  • Marinka M G Koenis‎ et al.
  • Human brain mapping‎
  • 2021‎

Although previous studies have highlighted associations of cannabis use with cognition and brain morphometry, critical questions remain with regard to the association between cannabis use and brain structural and functional connectivity. In a cross-sectional community sample of 205 African Americans (age 18-70) we tested for associations of cannabis use disorder (CUD, n = 57) with multi-domain cognitive measures and structural, diffusion, and resting state brain-imaging phenotypes. Post hoc model evidence was computed with Bayes factors (BF) and posterior probabilities of association (PPA) to account for multiple testing. General cognitive functioning, verbal intelligence, verbal memory, working memory, and motor speed were lower in the CUD group compared with non-users (p < .011; 1.9 < BF < 3,217). CUD was associated with altered functional connectivity in a network comprising the motor-hand region in the superior parietal gyri and the anterior insula (p < .04). These differences were not explained by alcohol, other drug use, or education. No associations with CUD were observed in cortical thickness, cortical surface area, subcortical or cerebellar volumes (0.12 < BF < 1.5), or graph-theoretical metrics of resting state connectivity (PPA < 0.01). In a large sample collected irrespective of cannabis used to minimize recruitment bias, we confirm the literature on poorer cognitive functioning in CUD, and an absence of volumetric brain differences between CUD and non-CUD. We did not find evidence for or against a disruption of structural connectivity, whereas we did find localized resting state functional dysconnectivity in CUD. There was sufficient proof, however, that organization of functional connectivity as determined via graph metrics does not differ between CUD and non-user group.


Sequencing of 53,831 diverse genomes from the NHLBI TOPMed Program.

  • Daniel Taliun‎ et al.
  • Nature‎
  • 2021‎

The Trans-Omics for Precision Medicine (TOPMed) programme seeks to elucidate the genetic architecture and biology of heart, lung, blood and sleep disorders, with the ultimate goal of improving diagnosis, treatment and prevention of these diseases. The initial phases of the programme focused on whole-genome sequencing of individuals with rich phenotypic data and diverse backgrounds. Here we describe the TOPMed goals and design as well as the available resources and early insights obtained from the sequence data. The resources include a variant browser, a genotype imputation server, and genomic and phenotypic data that are available through dbGaP (Database of Genotypes and Phenotypes)1. In the first 53,831 TOPMed samples, we detected more than 400 million single-nucleotide and insertion or deletion variants after alignment with the reference genome. Additional previously undescribed variants were detected through assembly of unmapped reads and customized analysis in highly variable loci. Among the more than 400 million detected variants, 97% have frequencies of less than 1% and 46% are singletons that are present in only one individual (53% among unrelated individuals). These rare variants provide insights into mutational processes and recent human evolutionary history. The extensive catalogue of genetic variation in TOPMed studies provides unique opportunities for exploring the contributions of rare and noncoding sequence variants to phenotypic variation. Furthermore, combining TOPMed haplotypes with modern imputation methods improves the power and reach of genome-wide association studies to include variants down to a frequency of approximately 0.01%.


Impact of Rare and Common Genetic Variants on Diabetes Diagnosis by Hemoglobin A1c in Multi-Ancestry Cohorts: The Trans-Omics for Precision Medicine Program.

  • Chloé Sarnowski‎ et al.
  • American journal of human genetics‎
  • 2019‎

Hemoglobin A1c (HbA1c) is widely used to diagnose diabetes and assess glycemic control in individuals with diabetes. However, nonglycemic determinants, including genetic variation, may influence how accurately HbA1c reflects underlying glycemia. Analyzing the NHLBI Trans-Omics for Precision Medicine (TOPMed) sequence data in 10,338 individuals from five studies and four ancestries (6,158 Europeans, 3,123 African-Americans, 650 Hispanics, and 407 East Asians), we confirmed five regions associated with HbA1c (GCK in Europeans and African-Americans, HK1 in Europeans and Hispanics, FN3K and/or FN3KRP in Europeans, and G6PD in African-Americans and Hispanics) and we identified an African-ancestry-specific low-frequency variant (rs1039215 in HBG2 and HBE1, minor allele frequency (MAF) = 0.03). The most associated G6PD variant (rs1050828-T, p.Val98Met, MAF = 12% in African-Americans, MAF = 2% in Hispanics) lowered HbA1c (-0.88% in hemizygous males, -0.34% in heterozygous females) and explained 23% of HbA1c variance in African-Americans and 4% in Hispanics. Additionally, we identified a rare distinct G6PD coding variant (rs76723693, p.Leu353Pro, MAF = 0.5%; -0.98% in hemizygous males, -0.46% in heterozygous females) and detected significant association with HbA1c when aggregating rare missense variants in G6PD. We observed similar magnitude and direction of effects for rs1039215 (HBG2) and rs76723693 (G6PD) in the two largest TOPMed African American cohorts, and we replicated the rs76723693 association in the UK Biobank African-ancestry participants. These variants in G6PD and HBG2 were monomorphic in the European and Asian samples. African or Hispanic ancestry individuals carrying G6PD variants may be underdiagnosed for diabetes when screened with HbA1c. Thus, assessment of these variants should be considered for incorporation into precision medicine approaches for diabetes diagnosis.


Chromosome Xq23 is associated with lower atherogenic lipid concentrations and favorable cardiometabolic indices.

  • Pradeep Natarajan‎ et al.
  • Nature communications‎
  • 2021‎

Autosomal genetic analyses of blood lipids have yielded key insights for coronary heart disease (CHD). However, X chromosome genetic variation is understudied for blood lipids in large sample sizes. We now analyze genetic and blood lipid data in a high-coverage whole X chromosome sequencing study of 65,322 multi-ancestry participants and perform replication among 456,893 European participants. Common alleles on chromosome Xq23 are strongly associated with reduced total cholesterol, LDL cholesterol, and triglycerides (min P = 8.5 × 10-72), with similar effects for males and females. Chromosome Xq23 lipid-lowering alleles are associated with reduced odds for CHD among 42,545 cases and 591,247 controls (P = 1.7 × 10-4), and reduced odds for diabetes mellitus type 2 among 54,095 cases and 573,885 controls (P = 1.4 × 10-5). Although we observe an association with increased BMI, waist-to-hip ratio adjusted for BMI is reduced, bioimpedance analyses indicate increased gluteofemoral fat, and abdominal MRI analyses indicate reduced visceral adiposity. Co-localization analyses strongly correlate increased CHRDL1 gene expression, particularly in adipose tissue, with reduced concentrations of blood lipids.


1q21.1 distal copy number variants are associated with cerebral and cognitive alterations in humans.

  • Ida E Sønderby‎ et al.
  • Translational psychiatry‎
  • 2021‎

Low-frequency 1q21.1 distal deletion and duplication copy number variant (CNV) carriers are predisposed to multiple neurodevelopmental disorders, including schizophrenia, autism and intellectual disability. Human carriers display a high prevalence of micro- and macrocephaly in deletion and duplication carriers, respectively. The underlying brain structural diversity remains largely unknown. We systematically called CNVs in 38 cohorts from the large-scale ENIGMA-CNV collaboration and the UK Biobank and identified 28 1q21.1 distal deletion and 22 duplication carriers and 37,088 non-carriers (48% male) derived from 15 distinct magnetic resonance imaging scanner sites. With standardized methods, we compared subcortical and cortical brain measures (all) and cognitive performance (UK Biobank only) between carrier groups also testing for mediation of brain structure on cognition. We identified positive dosage effects of copy number on intracranial volume (ICV) and total cortical surface area, with the largest effects in frontal and cingulate cortices, and negative dosage effects on caudate and hippocampal volumes. The carriers displayed distinct cognitive deficit profiles in cognitive tasks from the UK Biobank with intermediate decreases in duplication carriers and somewhat larger in deletion carriers-the latter potentially mediated by ICV or cortical surface area. These results shed light on pathobiological mechanisms of neurodevelopmental disorders, by demonstrating gene dose effect on specific brain structures and effect on cognitive function.


Whole-genome sequencing association analysis of quantitative red blood cell phenotypes: The NHLBI TOPMed program.

  • Yao Hu‎ et al.
  • American journal of human genetics‎
  • 2021‎

Whole-genome sequencing (WGS), a powerful tool for detecting novel coding and non-coding disease-causing variants, has largely been applied to clinical diagnosis of inherited disorders. Here we leveraged WGS data in up to 62,653 ethnically diverse participants from the NHLBI Trans-Omics for Precision Medicine (TOPMed) program and assessed statistical association of variants with seven red blood cell (RBC) quantitative traits. We discovered 14 single variant-RBC trait associations at 12 genomic loci, which have not been reported previously. Several of the RBC trait-variant associations (RPN1, ELL2, MIDN, HBB, HBA1, PIEZO1, and G6PD) were replicated in independent GWAS datasets imputed to the TOPMed reference panel. Most of these discovered variants are rare/low frequency, and several are observed disproportionately among non-European Ancestry (African, Hispanic/Latino, or East Asian) populations. We identified a 3 bp indel p.Lys2169del (g.88717175_88717177TCT[4]) (common only in the Ashkenazi Jewish population) of PIEZO1, a gene responsible for the Mendelian red cell disorder hereditary xerocytosis (MIM: 194380), associated with higher mean corpuscular hemoglobin concentration (MCHC). In stepwise conditional analysis and in gene-based rare variant aggregated association analysis, we identified several of the variants in HBB, HBA1, TMPRSS6, and G6PD that represent the carrier state for known coding, promoter, or splice site loss-of-function variants that cause inherited RBC disorders. Finally, we applied base and nuclease editing to demonstrate that the sentinel variant rs112097551 (nearest gene RPN1) acts through a cis-regulatory element that exerts long-range control of the gene RUVBL1 which is essential for hematopoiesis. Together, these results demonstrate the utility of WGS in ethnically diverse population-based samples and gene editing for expanding knowledge of the genetic architecture of quantitative hematologic traits and suggest a continuum between complex trait and Mendelian red cell disorders.


APOC3 genetic variation, serum triglycerides, and risk of coronary artery disease in Asian Indians, Europeans, and other ethnic groups.

  • Shiwali Goyal‎ et al.
  • Lipids in health and disease‎
  • 2021‎

Hypertriglyceridemia has emerged as a critical coronary artery disease (CAD) risk factor. Rare loss-of-function (LoF) variants in apolipoprotein C-III have been reported to reduce triglycerides (TG) and are cardioprotective in American Indians and Europeans. However, there is a lack of data in other Europeans and non-Europeans. Also, whether genetically increased plasma TG due to ApoC-III is causally associated with increased CAD risk is still unclear and inconsistent. The objectives of this study were to verify the cardioprotective role of earlier reported six LoF variants of APOC3 in South Asians and other multi-ethnic cohorts and to evaluate the causal association of TG raising common variants for increasing CAD risk.


Rare coding variants in RCN3 are associated with blood pressure.

  • Karen Y He‎ et al.
  • BMC genomics‎
  • 2022‎

While large genome-wide association studies have identified nearly one thousand loci associated with variation in blood pressure, rare variant identification is still a challenge. In family-based cohorts, genome-wide linkage scans have been successful in identifying rare genetic variants for blood pressure. This study aims to identify low frequency and rare genetic variants within previously reported linkage regions on chromosomes 1 and 19 in African American families from the Trans-Omics for Precision Medicine (TOPMed) program. Genetic association analyses weighted by linkage evidence were completed with whole genome sequencing data within and across TOPMed ancestral groups consisting of 60,388 individuals of European, African, East Asian, Hispanic, and Samoan ancestries.


Structural variation across 138,134 samples in the TOPMed consortium.

  • Goo Jun‎ et al.
  • bioRxiv : the preprint server for biology‎
  • 2023‎

Ever larger Structural Variant (SV) catalogs highlighting the diversity within and between populations help researchers better understand the links between SVs and disease. The identification of SVs from DNA sequence data is non-trivial and requires a balance between comprehensiveness and precision. Here we present a catalog of 355,667 SVs (59.34% novel) across autosomes and the X chromosome (50bp+) from 138,134 individuals in the diverse TOPMed consortium. We describe our methodologies for SV inference resulting in high variant quality and >90% allele concordance compared to long-read de-novo assemblies of well-characterized control samples. We demonstrate utility through significant associations between SVs and important various cardio-metabolic and hemotologic traits. We have identified 690 SV hotspots and deserts and those that potentially impact the regulation of medically relevant genes. This catalog characterizes SVs across multiple populations and will serve as a valuable tool to understand the impact of SV on disease development and progression.


  1. SciCrunch.org Resources

    Welcome to the FDI Lab - SciCrunch.org Resources search. From here you can search through a compilation of resources used by FDI Lab - SciCrunch.org and see how data is organized within our community.

  2. Navigation

    You are currently on the Community Resources tab looking through categories and sources that FDI Lab - SciCrunch.org has compiled. You can navigate through those categories from here or change to a different tab to execute your search through. Each tab gives a different perspective on data.

  3. Logging in and Registering

    If you have an account on FDI Lab - SciCrunch.org then you can log in from here to get additional features in FDI Lab - SciCrunch.org such as Collections, Saved Searches, and managing Resources.

  4. Searching

    Here is the search term that is being executed, you can type in anything you want to search for. Some tips to help searching:

    1. Use quotes around phrases you want to match exactly
    2. You can manually AND and OR terms to change how we search between words
    3. You can add "-" to terms to make sure no results return with that term in them (ex. Cerebellum -CA1)
    4. You can add "+" to terms to require they be in the data
    5. Using autocomplete specifies which branch of our semantics you with to search and can help refine your search
  5. Save Your Search

    You can save any searches you perform for quick access to later from here.

  6. Query Expansion

    We recognized your search term and included synonyms and inferred terms along side your term to help get the data you are looking for.

  7. Collections

    If you are logged into FDI Lab - SciCrunch.org you can add data records to your collections to create custom spreadsheets across multiple sources of data.

  8. Facets

    Here are the facets that you can filter your papers by.

  9. Options

    From here we'll present any options for the literature, such as exporting your current results.

  10. Further Questions

    If you have any further questions please check out our FAQs Page to ask questions and see our tutorials. Click this button to view this tutorial again.

Publications Per Year

X

Year:

Count: