This service exclusively searches for literature that cites resources. Please be aware that the total number of searchable documents is limited to those containing RRIDs and does not include all open-access literature.
The ever-increasing number of sequenced and annotated genomes has made management of their annotations a significant undertaking, especially for large eukaryotic genomes containing many thousands of genes. Typically, changes in gene and transcript numbers are used to summarize changes from release to release, but these measures say nothing about changes to individual annotations, nor do they provide any means to identify annotations in need of manual review.
High-throughput genotyping data are useful for making inferences about human evolutionary history. However, the populations sampled to date are unevenly distributed, and some areas (e.g., South and Central Asia) have rarely been sampled in large-scale studies. To assess human genetic variation more evenly, we sampled 296 individuals from 13 worldwide populations that are not covered by previous studies. By combining these samples with a data set from our laboratory and the HapMap II samples, we assembled a final dataset of ~250,000 SNPs in 850 individuals from 40 populations. With more uniform sampling, the estimate of global genetic differentiation (F(ST)) substantially decreases from ~16% with the HapMap II samples to ~11%. A panel of copy number variations typed in the same populations shows patterns of diversity similar to the SNP data, with highest diversity in African populations. This unique sample collection also permits new inferences about human evolutionary history. The comparison of haplotype variation among populations supports a single out-of-Africa migration event and suggests that the founding population of Eurasia may have been relatively large but isolated from Africans for a period of time. We also found a substantial affinity between populations from central Asia (Kyrgyzstani and Mongolian Buryat) and America, suggesting a central Asian contribution to New World founder populations.
Adaptation requires genetic variation, but founder populations are generally genetically depleted. Here we sequence two populations of an inbred ant that diverge in phenotype to determine how variability is generated. Cardiocondyla obscurior has the smallest of the sequenced ant genomes and its structure suggests a fundamental role of transposable elements (TEs) in adaptive evolution. Accumulations of TEs (TE islands) comprising 7.18% of the genome evolve faster than other regions with regard to single-nucleotide variants, gene/exon duplications and deletions and gene homology. A non-random distribution of gene families, larvae/adult specific gene expression and signs of differential methylation in TE islands indicate intragenomic differences in regulation, evolutionary rates and coalescent effective population size. Our study reveals a tripartite interplay between TEs, life history and adaptation in an invasive species.
Gibbons are small arboreal apes that display an accelerated rate of evolutionary chromosomal rearrangement and occupy a key node in the primate phylogeny between Old World monkeys and great apes. Here we present the assembly and analysis of a northern white-cheeked gibbon (Nomascus leucogenys) genome. We describe the propensity for a gibbon-specific retrotransposon (LAVA) to insert into chromosome segregation genes and alter transcription by providing a premature termination site, suggesting a possible molecular mechanism for the genome plasticity of the gibbon lineage. We further show that the gibbon genera (Nomascus, Hylobates, Hoolock and Symphalangus) experienced a near-instantaneous radiation ∼5 million years ago, coincident with major geographical changes in southeast Asia that caused cycles of habitat compression and expansion. Finally, we identify signatures of positive selection in genes important for forelimb development (TBX5) and connective tissues (COL1A1) that may have been involved in the adaptation of gibbons to their arboreal habitat.
TBX3 is a member of the T-box family of transcription factors with critical roles in development, oncogenesis, cell fate, and tissue homeostasis. TBX3 mutations in humans cause complex congenital malformations and Ulnar-mammary syndrome. Previous investigations into TBX3 function focused on its activity as a transcriptional repressor. We used an unbiased proteomic approach to identify TBX3 interacting proteins in vivo and discovered that TBX3 interacts with multiple mRNA splicing factors and RNA metabolic proteins. We discovered that TBX3 regulates alternative splicing in vivo and can promote or inhibit splicing depending on context and transcript. TBX3 associates with alternatively spliced mRNAs and binds RNA directly. TBX3 binds RNAs containing TBX binding motifs, and these motifs are required for regulation of splicing. Our study reveals that TBX3 mutations seen in humans with UMS disrupt its splicing regulatory function. The pleiotropic effects of TBX3 mutations in humans and mice likely result from disrupting at least two molecular functions of this protein: transcriptional regulation and pre-mRNA splicing.
Deedu (DU) Mongolians, who migrated from the Mongolian steppes to the Qinghai-Tibetan Plateau approximately 500 years ago, are challenged by environmental conditions similar to native Tibetan highlanders. Identification of adaptive genetic factors in this population could provide insight into coordinated physiological responses to this environment. Here we examine genomic and phenotypic variation in this unique population and present the first complete analysis of a Mongolian whole-genome sequence. High-density SNP array data demonstrate that DU Mongolians share genetic ancestry with other Mongolian as well as Tibetan populations, specifically in genomic regions related with adaptation to high altitude. Several selection candidate genes identified in DU Mongolians are shared with other Asian groups (e.g., EDAR), neighboring Tibetan populations (including high-altitude candidates EPAS1, PKLR, and CYP2E1), as well as genes previously hypothesized to be associated with metabolic adaptation (e.g., PPARG). Hemoglobin concentration, a trait associated with high-altitude adaptation in Tibetans, is at an intermediate level in DU Mongolians compared to Tibetans and Han Chinese at comparable altitude. Whole-genome sequence from a DU Mongolian (Tianjiao1) shows that about 2% of the genomic variants, including more than 300 protein-coding changes, are specific to this individual. Our analyses of DU Mongolians and the first Mongolian genome provide valuable insight into genetic adaptation to extreme environments.
Second-generation sequencing technologies are precipitating major shifts with regards to what kinds of genomes are being sequenced and how they are annotated. While the first generation of genome projects focused on well-studied model organisms, many of today's projects involve exotic organisms whose genomes are largely terra incognita. This complicates their annotation, because unlike first-generation projects, there are no pre-existing 'gold-standard' gene-models with which to train gene-finders. Improvements in genome assembly and the wide availability of mRNA-seq data are also creating opportunities to update and re-annotate previously published genome annotations. Today's genome projects are thus in need of new genome annotation tools that can meet the challenges and opportunities presented by second-generation sequencing technologies.
The sea lamprey (Petromyzon marinus) serves as a comparative model for reconstructing vertebrate evolution. To enable more informed analyses, we developed a new assembly of the lamprey germline genome that integrates several complementary data sets. Analysis of this highly contiguous (chromosome-scale) assembly shows that both chromosomal and whole-genome duplications have played significant roles in the evolution of ancestral vertebrate and lamprey genomes, including chromosomes that carry the six lamprey HOX clusters. The assembly also contains several hundred genes that are reproducibly eliminated from somatic cells during early development in lamprey. Comparative analyses show that gnathostome (mouse) homologs of these genes are frequently marked by polycomb repressive complexes (PRCs) in embryonic stem cells, suggesting overlaps in the regulatory logic of somatic DNA elimination and bivalent states that are regulated by early embryonic PRCs. This new assembly will enhance diverse studies that are informed by lampreys' unique biology and evolutionary/comparative perspective.
The genus Conus comprises approximately 700 species of venomous marine cone snails that are highly efficient predators of worms, snails, and fish. In evolutionary terms, cone snails are relatively young with the earliest fossil records occurring in the Lower Eocene, 55 Ma. The rapid radiation of cone snail species has been accompanied by remarkably high rates of toxin diversification. To shed light on the molecular mechanisms that accompany speciation, we investigated the toxin repertoire of two sister species, Conus andremenezi and Conus praecellens, that were until recently considered a single variable species. A total of 196 and 250 toxin sequences were identified in the venom gland transcriptomes of C. andremenezi and C. praecellens belonging to 25 and 29 putative toxin gene superfamilies, respectively. Comparative analysis with closely (Conus tribblei and Conus lenavati) and more distantly related species (Conus geographus) suggests that speciation is associated with significant diversification of individual toxin genes (exogenes) whereas the expression pattern of toxin gene superfamilies within lineages remains largely conserved. Thus, changes within individual toxin sequences can serve as a sensitive indicator for recent speciation whereas changes in the expression pattern of gene superfamilies are likely to reflect more dramatic differences in a species' interaction with its prey, predators, and competitors.
The diaphragm is critical for respiration and separation of the thoracic and abdominal cavities, and defects in diaphragm development are the cause of congenital diaphragmatic hernias (CDH), a common and often lethal birth defect. The genetic etiology of CDH is complex. Single-nucleotide variants (SNVs), insertions/deletions (indels), and structural variants (SVs) in more than 150 genes have been associated with CDH, although few genes are recurrently mutated in multiple individuals and mutated genes are incompletely penetrant. This suggests that multiple genetic variants in combination, other not-yet-investigated classes of variants, and/or nongenetic factors contribute to CDH etiology. However, no studies have comprehensively investigated in affected individuals the contribution of all possible classes of variants throughout the genome to CDH etiology. In our study, we used a unique cohort of four individuals with isolated CDH with samples from blood, skin, and diaphragm connective tissue and parental blood and deep whole-genome sequencing to assess germline and somatic de novo and inherited SNVs, indels, and SVs. In each individual we found a different mutational landscape that included germline de novo and inherited SNVs and indels in multiple genes. We also found in two individuals a 343 bp deletion interrupting an annotated enhancer of the CDH-associated gene GATA4, and we hypothesize that this common SV (found in 1%-2% of the population) acts as a sensitizing allele for CDH. Overall, our comprehensive reconstruction of the genetic architecture of four CDH individuals demonstrates that the etiology of CDH is heterogeneous and multifactorial.
Prioritization of sequence variants for diagnosis and discovery of Mendelian diseases is challenging, especially in large collections of whole genome sequences (WGS). Fast, scalable solutions are needed for discovery research, for clinical applications, and for curation of massive public variant repositories such as dbSNP and gnomAD. In response, we have developed VVP, the VAAST Variant Prioritizer. VVP is ultrafast, scales to even the largest variant repositories and genome collections, and its outputs are designed to simplify clinical interpretation of variants of uncertain significance.
In contrast to the western honey bee, Apis mellifera, other honey bee species have been largely neglected despite their importance and diversity. The genetic basis of the evolutionary diversification of honey bees remains largely unknown. Here, we provide a genome-wide comparison of three honey bee species, each representing one of the three subgenera of honey bees, namely the dwarf (Apis florea), giant (A. dorsata), and cavity-nesting (A. mellifera) honey bees with bumblebees as an outgroup. Our analyses resolve the phylogeny of honey bees with the dwarf honey bees diverging first. We find that evolution of increased eusocial complexity in Apis proceeds via increases in the complexity of gene regulation, which is in agreement with previous studies. However, this process seems to be related to pathways other than transcriptional control. Positive selection patterns across Apis reveal a trade-off between maintaining genome stability and generating genetic diversity, with a rapidly evolving piRNA pathway leading to genomes depleted of transposable elements, and a rapidly evolving DNA repair pathway associated with high recombination rates in all Apis species. Diversification within Apis is accompanied by positive selection in several genes whose putative functions present candidate mechanisms for lineage-specific adaptations, such as migration, immunity, and nesting behavior.
Although many loci have been associated with height in European ancestry populations, very few have been identified in African ancestry individuals. Furthermore, many of the known loci have yet to be generalized to and fine-mapped within a large-scale African ancestry sample. We performed sex-combined and sex-stratified meta-analyses in up to 52,764 individuals with height and genome-wide genotyping data from the African Ancestry Anthropometry Genetics Consortium (AAAGC). We additionally combined our African ancestry meta-analysis results with published European genome-wide association study (GWAS) data. In the African ancestry analyses, we identified three novel loci (SLC4A3, NCOA2, ECD/FAM149B1) in sex-combined results and two loci (CRB1, KLF6) in women only. In the African plus European sex-combined GWAS, we identified an additional three novel loci (RCCD1, G6PC3, CEP95) which were equally driven by AAAGC and European results. Among 39 genome-wide significant signals at known loci, conditioning index SNPs from European studies identified 20 secondary signals. Two of the 20 new secondary signals and none of the 8 novel loci had minor allele frequencies (MAF) < 5%. Of 802 known European height signals, 643 displayed directionally consistent associations with height, of which 205 were nominally significant (p < 0.05) in the African ancestry sex-combined sample. Furthermore, 148 of 241 loci contained ≤20 variants in the credible sets that jointly account for 99% of the posterior probability of driving the associations. In summary, trans-ethnic meta-analyses revealed novel signals and further improved fine-mapping of putative causal variants in loci shared between African and European ancestry populations.
There is strong evidence for a genetic contribution to non-syndromic congenital heart defects (CHDs). However, exome- and genome-wide studies conducted at the variant and gene-level have identified few genome-wide significant CHD-related genes. Gene-set analyses are a useful complement to such studies and candidate gene-set analyses of rare variants have provided insight into the genetics of CHDs. However, similar analyses have not been conducted using data on common genetic variants. Consequently, we conducted common variant analyses of 15 CHD candidate gene-sets, using data from two common types of CHDs: conotruncal heart defects (1431 cases) and left ventricular outflow tract defects (509 cases). After Bonferroni correction for evaluation of multiple gene-sets, the cytoskeletal gene-set was significantly associated with conotruncal heart defects (βS = 0.09; 95% confidence interval (CI) 0.03-0.15). This association was stronger when analyses were restricted to the sub-set of cytoskeletal genes that have been observed to harbor rare damaging genotypes in at least two CHD cases (βS = 0.32, 95% CI 0.08-0.56). These findings add to the evidence linking cytoskeletal genes to CHDs and suggest that, for cytoskeletal genes, common variation may contribute to the risk of CHDs.
The genetic architecture of atrial fibrillation (AF) encompasses low impact, common genetic variants and high impact, rare variants. Here, we characterize a high impact AF-susceptibility allele, KCNQ1 R231H, and describe its transcontinental geographic distribution and history. Induced pluripotent stem cell-derived cardiomyocytes procured from risk allele carriers exhibit abbreviated action potential duration, consistent with a gain-of-function effect. Using identity-by-descent (IBD) networks, we estimate the broad- and fine-scale population ancestry of risk allele carriers and their relatives. Analysis of ancestral migration routes reveals ancestors who inhabited Denmark in the 1700s, migrated to the Northeastern United States in the early 1800s, and traveled across the Midwest to arrive in Utah in the late 1800s. IBD/coalescent-based allele dating analysis reveals a relatively recent origin of the AF risk allele (~5000 years). Thus, our approach broadens the scope of study for disease susceptibility alleles to the context of human migration and ancestral origins.
Endometriosis is a debilitating, chronic disease that is estimated to affect 11% of reproductive-age women. Diagnosis of endometriosis is difficult with diagnostic delays of up to 12 years reported. These delays can negatively impact health and quality of life. Vague, nonspecific symptoms, like pain, with multiple differential diagnoses contribute to the difficulty of diagnosis. By investigating previously imprecise symptoms of pain, we sought to clarify distinct pain symptoms indicative of endometriosis, using an artificial intelligence-based approach. We used data from 473 women undergoing laparoscopy or laparotomy for a variety of surgical indications. Multiple anatomical pain locations were clustered based on the associations across samples to increase the power in the probability calculations. A Bayesian network was developed using pain-related features, subfertility, and diagnoses. Univariable and multivariable analyses were performed by querying the network for the relative risk of a postoperative diagnosis, given the presence of different symptoms. Performance and sensitivity analyses demonstrated the advantages of Bayesian network analysis over traditional statistical techniques. Clustering grouped the 155 anatomical sites of pain into 15 pain locations. After pruning, the final Bayesian network included 18 nodes. The presence of any pain-related feature increased the relative risk of endometriosis (p-value < 0.001). The constellation of chronic pelvic pain, subfertility, and dyspareunia resulted in the greatest increase in the relative risk of endometriosis. The performance and sensitivity analyses demonstrated that the Bayesian network could identify and analyze more significant associations with endometriosis than traditional statistical techniques. Pelvic pain, frequently associated with endometriosis, is a common and vague symptom. Our Bayesian network for the study of pain-related features of endometriosis revealed specific pain locations and pain types that potentially forecast the diagnosis of endometriosis.
Nonsense-mediated messenger RNA (mRNA) decay (NMD) is a mRNA degradation pathway that regulates a significant portion of the transcriptome. The expression levels of numerous genes are known to be altered in NMD mutants, but it is not known which of these transcripts is a direct pathway target. Here, we present the first genome-wide analysis of direct NMD targeting in an intact animal. By using rapid reactivation of the NMD pathway in a Drosophila melanogaster NMD mutant and globally monitoring of changes in mRNA expression levels, we can distinguish between primary and secondary effects of NMD on gene expression. Using this procedure, we identified 168 candidate direct NMD targets in vivo. Remarkably, we found that 81% of direct target genes do not show increased expression levels in an NMD mutant, presumably due to feedback regulation. Because most previous studies have used up-regulation of mRNA expression as the only means to identify NMD-regulated transcripts, our results provide new directions for understanding the roles of the NMD pathway in endogenous gene regulation during animal development and physiology. For instance, we show clearly that direct target genes have longer 3' untranslated regions compared with nontargets, suggesting long 3' untranslated regions target mRNAs for NMD in vivo. In addition, we investigated the role of NMD in suppressing transcriptional noise and found that although the transposable element Copia is up-regulated in NMD mutants, this effect appears to be indirect.
Welcome to the FDI Lab - SciCrunch.org Resources search. From here you can search through a compilation of resources used by FDI Lab - SciCrunch.org and see how data is organized within our community.
You are currently on the Community Resources tab looking through categories and sources that FDI Lab - SciCrunch.org has compiled. You can navigate through those categories from here or change to a different tab to execute your search through. Each tab gives a different perspective on data.
If you have an account on FDI Lab - SciCrunch.org then you can log in from here to get additional features in FDI Lab - SciCrunch.org such as Collections, Saved Searches, and managing Resources.
Here is the search term that is being executed, you can type in anything you want to search for. Some tips to help searching:
You can save any searches you perform for quick access to later from here.
We recognized your search term and included synonyms and inferred terms along side your term to help get the data you are looking for.
If you are logged into FDI Lab - SciCrunch.org you can add data records to your collections to create custom spreadsheets across multiple sources of data.
Here are the facets that you can filter your papers by.
From here we'll present any options for the literature, such as exporting your current results.
If you have any further questions please check out our FAQs Page to ask questions and see our tutorials. Click this button to view this tutorial again.
Year:
Count: