Searching across hundreds of databases

Our searching services are busy right now. Your search will reload in five seconds.

X
Forgot Password

If you have forgotten your password you can enter your email here and get a temporary password sent to your email.

X
Forgot Password

If you have forgotten your password you can enter your email here and get a temporary password sent to your email.

This service exclusively searches for literature that cites resources. Please be aware that the total number of searchable documents is limited to those containing RRIDs and does not include all open-access literature.

Search

Type in a keyword to search

On page 1 showing 1 ~ 20 papers out of 36 papers

Long-read sequencing and de novo assembly of a Chinese genome.

  • Lingling Shi‎ et al.
  • Nature communications‎
  • 2016‎

Short-read sequencing has enabled the de novo assembly of several individual human genomes, but with inherent limitations in characterizing repeat elements. Here we sequence a Chinese individual HX1 by single-molecule real-time (SMRT) long-read sequencing, construct a physical map by NanoChannel arrays and generate a de novo assembly of 2.93 Gb (contig N50: 8.3 Mb, scaffold N50: 22.0 Mb, including 39.3 Mb N-bases), together with 206 Mb of alternative haplotypes. The assembly fully or partially fills 274 (28.4%) N-gaps in the reference genome GRCh38. Comparison to GRCh38 reveals 12.8 Mb of HX1-specific sequences, including 4.1 Mb that are not present in previously reported Asian genomes. Furthermore, long-read sequencing of the transcriptome reveals novel spliced genes that are not annotated in GENCODE and are missed by short-read RNA-Seq. Our results imply that improved characterization of genome functional variation may require the use of a range of genomic technologies on diverse human populations.


SeqHBase: a big data toolset for family based sequencing data analysis.

  • Min He‎ et al.
  • Journal of medical genetics‎
  • 2015‎

Whole-genome sequencing (WGS) and whole-exome sequencing (WES) technologies are increasingly used to identify disease-contributing mutations in human genomic studies. It can be a significant challenge to process such data, especially when a large family or cohort is sequenced. Our objective was to develop a big data toolset to efficiently manipulate genome-wide variants, functional annotations and coverage, together with conducting family based sequencing data analysis.


"Genotype-first" approaches on a curious case of idiopathic progressive cognitive decline.

  • Lingling Shi‎ et al.
  • BMC medical genomics‎
  • 2014‎

In developing countries, many cases with rare neurological diseases remain undiagnosed due to limited diagnostic experience. We encountered a case in China where two siblings both began to develop idiopathic progressive cognitive decline starting from age six, and were suspected to have an undiagnosed neurological disease.


Low concordance of multiple variant-calling pipelines: practical implications for exome and genome sequencing.

  • Jason O'Rawe‎ et al.
  • Genome medicine‎
  • 2013‎

To facilitate the clinical implementation of genomic medicine by next-generation sequencing, it will be critically important to obtain accurate and consistent variant calls on personal genomes. Multiple software tools for variant calling are available, but it is unclear how comparable these tools are or what their relative merits in real-world scenarios might be.


Identifying disease mutations in genomic medicine settings: current challenges and how to accelerate progress.

  • Gholson J Lyon‎ et al.
  • Genome medicine‎
  • 2012‎

The pace of exome and genome sequencing is accelerating, with the identification of many new disease-causing mutations in research settings, and it is likely that whole exome or genome sequencing could have a major impact in the clinical arena in the relatively near future. However, the human genomics community is currently facing several challenges, including phenotyping, sample collection, sequencing strategies, bioinformatics analysis, biological validation of variant function, clinical interpretation and validity of variant data, and delivery of genomic information to various constituents. Here we review these challenges and summarize the bottlenecks for the clinical application of exome and genome sequencing, and we discuss ways for moving the field forward. In particular, we urge the need for clinical-grade sample collection, high-quality sequencing data acquisition, digitalized phenotyping, rigorous generation of variant calls, and comprehensive functional annotation of variants. Additionally, we suggest that a 'networking of science' model that encourages much more collaboration and online sharing of medical history, genomic data and biological knowledge, including among research participants and consumers/patients, will help establish causation and penetrance for disease causal variants and genes. As we enter this new era of genomic medicine, we envision that consumer-driven and consumer-oriented efforts will take center stage, thus allowing insights from the human genome project to translate directly back into individualized medicine.


Scikit-ribo Enables Accurate Estimation and Robust Modeling of Translation Dynamics at Codon Resolution.

  • Han Fang‎ et al.
  • Cell systems‎
  • 2018‎

Ribosome profiling (Ribo-seq) is a powerful technique for measuring protein translation; however, sampling errors and biological biases are prevalent and poorly understood. Addressing these issues, we present Scikit-ribo (https://github.com/schatzlab/scikit-ribo), an open-source analysis package for accurate genome-wide A-site prediction and translation efficiency (TE) estimation from Ribo-seq and RNA sequencing data. Scikit-ribo accurately identifies A-site locations and reproduces codon elongation rates using several digestion protocols (r = 0.99). Next, we show that the commonly used reads per kilobase of transcript per million mapped reads-derived TE estimation is prone to biases, especially for low-abundance genes. Scikit-ribo introduces a codon-level generalized linear model with ridge penalty that correctly estimates TE, while accommodating variable codon elongation rates and mRNA secondary structure. This corrects the TE errors for over 2,000 genes in S. cerevisiae, which we validate using mass spectrometry of protein abundances (r = 0.81), and allows us to determine the Kozak-like sequence directly from Ribo-seq. We conclude with an analysis of coverage requirements needed for robust codon-level analysis and quantify the artifacts that can occur from cycloheximide treatment.


Whole genome sequencing of one complex pedigree illustrates challenges with genomic medicine.

  • Han Fang‎ et al.
  • BMC medical genomics‎
  • 2017‎

Human Phenotype Ontology (HPO) has risen as a useful tool for precision medicine by providing a standardized vocabulary of phenotypic abnormalities to describe presentations of human pathologies; however, there have been relatively few reports combining whole genome sequencing (WGS) and HPO, especially in the context of structural variants.


Naa12 compensates for Naa10 in mice in the amino-terminal acetylation pathway.

  • Hyae Yon Kweon‎ et al.
  • eLife‎
  • 2021‎

Amino-terminal acetylation is catalyzed by a set of N-terminal acetyltransferases (NATs). The NatA complex (including X-linked Naa10 and Naa15) is the major acetyltransferase, with 40-50% of all mammalian proteins being potential substrates. However, the overall role of amino-terminal acetylation on a whole-organism level is poorly understood, particularly in mammals. Male mice lacking Naa10 show no globally apparent in vivo amino-terminal acetylation impairment and do not exhibit complete embryonic lethality. Rather Naa10 nulls display increased neonatal lethality, and the majority of surviving undersized mutants exhibit a combination of hydrocephaly, cardiac defects, homeotic anterior transformation, piebaldism, and urogenital anomalies. Naa12 is a previously unannotated Naa10-like paralog with NAT activity that genetically compensates for Naa10. Mice deficient for Naa12 have no apparent phenotype, whereas mice deficient for Naa10 and Naa12 display embryonic lethality. The discovery of Naa12 adds to the currently known machinery involved in amino-terminal acetylation in mice.


Evaluating possible maternal effect lethality and genetic background effects in Naa10 knockout mice.

  • Gholson J Lyon‎ et al.
  • bioRxiv : the preprint server for biology‎
  • 2024‎

Amino-terminal (Nt-) acetylation (NTA) is a common protein modification, affecting approximately 80% of all human proteins. The human essential X-linked gene, NAA10, encodes for the enzyme NAA10, which is the catalytic subunit in the N-terminal acetyltransferase A (NatA) complex. There is extensive genetic variation in humans with missense, splice-site, and C-terminal frameshift variants in NAA10. In mice, Naa10 is not an essential gene, as there exists a paralogous gene, Naa12, that substantially rescues Naa10 knockout mice from embryonic lethality, whereas double knockouts (Naa10-/Y Naa12-/-) are embryonic lethal. However, the phenotypic variability in the mice is nonetheless quite extensive, including piebaldism, skeletal defects, small size, hydrocephaly, hydronephrosis, and neonatal lethality. Here we replicate these phenotypes with new genetic alleles in mice, but we demonstrate their modulation by genetic background and environmental effects. We cannot replicate a prior report of "maternal effect lethality" for heterozygous Naa10-/X female mice, but we do observe a small amount of embryonic lethality in the Naa10-/Y male mice on the inbred genetic background in this different animal facility.


An international effort towards developing standards for best practices in analysis, interpretation and reporting of clinical genome sequencing results in the CLARITY Challenge.

  • Catherine A Brownstein‎ et al.
  • Genome biology‎
  • 2014‎

There is tremendous potential for genome sequencing to improve clinical diagnosis and care once it becomes routinely accessible, but this will require formalizing research methods into clinical best practices in the areas of sequence data generation, analysis, interpretation and reporting. The CLARITY Challenge was designed to spur convergence in methods for diagnosing genetic disease starting from clinical case history and genome sequencing data. DNA samples were obtained from three families with heritable genetic disorders and genomic sequence data were donated by sequencing platform vendors. The challenge was to analyze and interpret these data with the goals of identifying disease-causing variants and reporting the findings in a clinically useful format. Participating contestant groups were solicited broadly, and an independent panel of judges evaluated their performance.


Accurate de novo and transmitted indel detection in exome-capture data using microassembly.

  • Giuseppe Narzisi‎ et al.
  • Nature methods‎
  • 2014‎

We present an open-source algorithm, Scalpel (http://scalpel.sourceforge.net/), which combines mapping and assembly for sensitive and specific discovery of insertions and deletions (indels) in exome-capture data. A detailed repeat analysis coupled with a self-tuning k-mer strategy allows Scalpel to outperform other state-of-the-art approaches for indel discovery, particularly in regions containing near-perfect repeats. We analyzed 593 families from the Simons Simplex Collection and demonstrated Scalpel's power to detect long (≥30 bp) transmitted events and enrichment for de novo likely gene-disrupting indels in autistic children.


Truncating Variants in NAA15 Are Associated with Variable Levels of Intellectual Disability, Autism Spectrum Disorder, and Congenital Anomalies.

  • Hanyin Cheng‎ et al.
  • American journal of human genetics‎
  • 2018‎

N-alpha-acetylation is a common co-translational protein modification that is essential for normal cell function in humans. We previously identified the genetic basis of an X-linked infantile lethal Mendelian disorder involving a c.109T>C (p.Ser37Pro) missense variant in NAA10, which encodes the catalytic subunit of the N-terminal acetyltransferase A (NatA) complex. The auxiliary subunit of the NatA complex, NAA15, is the dimeric binding partner for NAA10. Through a genotype-first approach with whole-exome or genome sequencing (WES/WGS) and targeted sequencing analysis, we identified and phenotypically characterized 38 individuals from 33 unrelated families with 25 different de novo or inherited, dominantly acting likely gene disrupting (LGD) variants in NAA15. Clinical features of affected individuals with LGD variants in NAA15 include variable levels of intellectual disability, delayed speech and motor milestones, and autism spectrum disorder. Additionally, mild craniofacial dysmorphology, congenital cardiac anomalies, and seizures are present in some subjects. RNA analysis in cell lines from two individuals showed degradation of the transcripts with LGD variants, probably as a result of nonsense-mediated decay. Functional assays in yeast confirmed a deleterious effect for two of the LGD variants in NAA15. Further supporting a mechanism of haploinsufficiency, individuals with copy-number variant (CNV) deletions involving NAA15 and surrounding genes can present with mild intellectual disability, mild dysmorphic features, motor delays, and decreased growth. We propose that defects in NatA-mediated N-terminal acetylation (NTA) lead to variable levels of neurodevelopmental disorders in humans, supporting the importance of the NatA complex in normal human development.


SNVer: a statistical tool for variant calling in analysis of pooled or individual next-generation sequencing data.

  • Zhi Wei‎ et al.
  • Nucleic acids research‎
  • 2011‎

We develop a statistical tool SNVer for calling common and rare variants in analysis of pooled or individual next-generation sequencing (NGS) data. We formulate variant calling as a hypothesis testing problem and employ a binomial-binomial model to test the significance of observed allele frequency against sequencing error. SNVer reports one single overall P-value for evaluating the significance of a candidate locus being a variant based on which multiplicity control can be obtained. This is particularly desirable because tens of thousands loci are simultaneously examined in typical NGS experiments. Each user can choose the false-positive error rate threshold he or she considers appropriate, instead of just the dichotomous decisions of whether to 'accept or reject the candidates' provided by most existing methods. We use both simulated data and real data to demonstrate the superior performance of our program in comparison with existing methods. SNVer runs very fast and can complete testing 300 K loci within an hour. This excellent scalability makes it feasible for analysis of whole-exome sequencing data, or even whole-genome sequencing data using high performance computing cluster. SNVer is freely available at http://snver.sourceforge.net/.


TAF1 Variants Are Associated with Dysmorphic Features, Intellectual Disability, and Neurological Manifestations.

  • Jason A O'Rawe‎ et al.
  • American journal of human genetics‎
  • 2015‎

We describe an X-linked genetic syndrome associated with mutations in TAF1 and manifesting with global developmental delay, intellectual disability (ID), characteristic facial dysmorphology, generalized hypotonia, and variable neurologic features, all in male individuals. Simultaneous studies using diverse strategies led to the identification of nine families with overlapping clinical presentations and affected by de novo or maternally inherited single-nucleotide changes. Two additional families harboring large duplications involving TAF1 were also found to share phenotypic overlap with the probands harboring single-nucleotide changes, but they also demonstrated a severe neurodegeneration phenotype. Functional analysis with RNA-seq for one of the families suggested that the phenotype is associated with downregulation of a set of genes notably enriched with genes regulated by E-box proteins. In addition, knockdown and mutant studies of this gene in zebrafish have shown a quantifiable, albeit small, effect on a neuronal phenotype. Our results suggest that mutations in TAF1 play a critical role in the development of this X-linked ID syndrome.


Reducing INDEL calling errors in whole genome and exome sequencing data.

  • Han Fang‎ et al.
  • Genome medicine‎
  • 2014‎

INDELs, especially those disrupting protein-coding regions of the genome, have been strongly associated with human diseases. However, there are still many errors with INDEL variant calling, driven by library preparation, sequencing biases, and algorithm artifacts.


Synaptic processes and immune-related pathways implicated in Tourette syndrome.

  • Fotis Tsetsos‎ et al.
  • Translational psychiatry‎
  • 2021‎

Tourette syndrome (TS) is a neuropsychiatric disorder of complex genetic architecture involving multiple interacting genes. Here, we sought to elucidate the pathways that underlie the neurobiology of the disorder through genome-wide analysis. We analyzed genome-wide genotypic data of 3581 individuals with TS and 7682 ancestry-matched controls and investigated associations of TS with sets of genes that are expressed in particular cell types and operate in specific neuronal and glial functions. We employed a self-contained, set-based association method (SBA) as well as a competitive gene set method (MAGMA) using individual-level genotype data to perform a comprehensive investigation of the biological background of TS. Our SBA analysis identified three significant gene sets after Bonferroni correction, implicating ligand-gated ion channel signaling, lymphocytic, and cell adhesion and transsynaptic signaling processes. MAGMA analysis further supported the involvement of the cell adhesion and trans-synaptic signaling gene set. The lymphocytic gene set was driven by variants in FLT3, raising an intriguing hypothesis for the involvement of a neuroinflammatory element in TS pathogenesis. The indications of involvement of ligand-gated ion channel signaling reinforce the role of GABA in TS, while the association of cell adhesion and trans-synaptic signaling gene set provides additional support for the role of adhesion molecules in neuropsychiatric disorders. This study reinforces previous findings but also provides new insights into the neurobiology of TS.


Is Persistent Motor or Vocal Tic Disorder a Milder Form of Tourette Syndrome?

  • Karla Claudio-Campos‎ et al.
  • Movement disorders : official journal of the Movement Disorder Society‎
  • 2021‎

Persistent motor or vocal tic disorder (PMVT) has been hypothesized to be a forme fruste of Tourette syndrome (TS). Although the primary diagnostic criterion for PMVT (presence of motor or vocal tics, but not both) is clear, less is known about its clinical presentation.


GestaltMatcher Database - a FAIR database for medical imaging data of rare disorders.

  • Hellen Lesmann‎ et al.
  • medRxiv : the preprint server for health sciences‎
  • 2023‎

The value of computer-assisted image analysis has been shown in several studies. The performance of tools with artificial intelligence (AI), such as GestaltMatcher, is improved with the size and diversity of the training set, but properly labeled training data is currently the biggest bottleneck in developing next-generation phenotyping (NGP) applications. Therefore, we developed GestaltMatcher Database (GMDB) - a database for machine-readable medical image data that complies with the FAIR principles and improves the openness and accessibility of scientific findings in Medical Genetics. An entry in GMDB consists of a medical image such as a portrait, X-ray, or fundoscopy, and machine-readable meta information such as a clinical feature encoded in HPO terminology or a disease-causing mutation reported in HGVS format. In the beginning, data was mainly collected by curators gathering images from the literature. Currently, clinicians and individuals recruited from patient support groups provide their previously unpublished data. For this patient-centered approach, we developed a digital consent form. GMDB is a modern publication medium for case reports that complements preprints, e.g., on medRxiv. To enable inter-cohort comparisons, we implemented a research feature in GMDB that computes the pairwise syndromic similarity between hand-picked cases. Through a community-driven effort, we compiled an image collection of over 7,533 cases with 792 disorders in GMDB. Most of the data was collected from 2,058 publications. In addition, about 1,018 frontal images of 498 previously unpublished cases were obtained. The web interface enables gene- and phenotype-centered queries or infinite scrolls in the gallery. Digital consent has led to increasing adoption of the approach by patients. The research app within GMDB was used to generate syndromic similarity matrices to characterize two novel phenotypes (CSNK2B, PSMC3). GMDB is the first FAIR database for NGP, where data are findable, accessible, interoperable, and reusable. It is a repository for medical images that cannot be included in medRxiv. That means GMDB connects clinicians with a shared interest in particular phenotypes and improves the performance of AI.


Biochemical and cellular analysis of Ogden syndrome reveals downstream Nt-acetylation defects.

  • Line M Myklebust‎ et al.
  • Human molecular genetics‎
  • 2015‎

The X-linked lethal Ogden syndrome was the first reported human genetic disorder associated with a mutation in an N-terminal acetyltransferase (NAT) gene. The affected males harbor an Ser37Pro (S37P) mutation in the gene encoding Naa10, the catalytic subunit of NatA, the major human NAT involved in the co-translational acetylation of proteins. Structural models and molecular dynamics simulations of the human NatA and its S37P mutant highlight differences in regions involved in catalysis and at the interface between Naa10 and the auxiliary subunit hNaa15. Biochemical data further demonstrate a reduced catalytic capacity and an impaired interaction between hNaa10 S37P and Naa15 as well as Naa50 (NatE), another interactor of the NatA complex. N-Terminal acetylome analyses revealed a decreased acetylation of a subset of NatA and NatE substrates in Ogden syndrome cells, supporting the genetic findings and our hypothesis regarding reduced Nt-acetylation of a subset of NatA/NatE-type substrates as one etiology for Ogden syndrome. Furthermore, Ogden syndrome fibroblasts display abnormal cell migration and proliferation capacity, possibly linked to a perturbed retinoblastoma pathway. N-Terminal acetylation clearly plays a role in Ogden syndrome, thus revealing the in vivo importance of N-terminal acetylation in human physiology and disease.


The Human Phenotype Ontology in 2017.

  • Sebastian Köhler‎ et al.
  • Nucleic acids research‎
  • 2017‎

Deep phenotyping has been defined as the precise and comprehensive analysis of phenotypic abnormalities in which the individual components of the phenotype are observed and described. The three components of the Human Phenotype Ontology (HPO; www.human-phenotype-ontology.org) project are the phenotype vocabulary, disease-phenotype annotations and the algorithms that operate on these. These components are being used for computational deep phenotyping and precision medicine as well as integration of clinical data into translational research. The HPO is being increasingly adopted as a standard for phenotypic abnormalities by diverse groups such as international rare disease organizations, registries, clinical labs, biomedical resources, and clinical software tools and will thereby contribute toward nascent efforts at global data exchange for identifying disease etiologies. This update article reviews the progress of the HPO project since the debut Nucleic Acids Research database article in 2014, including specific areas of expansion such as common (complex) disease, new algorithms for phenotype driven genomic discovery and diagnostics, integration of cross-species mapping efforts with the Mammalian Phenotype Ontology, an improved quality control pipeline, and the addition of patient-friendly terminology.


  1. SciCrunch.org Resources

    Welcome to the FDI Lab - SciCrunch.org Resources search. From here you can search through a compilation of resources used by FDI Lab - SciCrunch.org and see how data is organized within our community.

  2. Navigation

    You are currently on the Community Resources tab looking through categories and sources that FDI Lab - SciCrunch.org has compiled. You can navigate through those categories from here or change to a different tab to execute your search through. Each tab gives a different perspective on data.

  3. Logging in and Registering

    If you have an account on FDI Lab - SciCrunch.org then you can log in from here to get additional features in FDI Lab - SciCrunch.org such as Collections, Saved Searches, and managing Resources.

  4. Searching

    Here is the search term that is being executed, you can type in anything you want to search for. Some tips to help searching:

    1. Use quotes around phrases you want to match exactly
    2. You can manually AND and OR terms to change how we search between words
    3. You can add "-" to terms to make sure no results return with that term in them (ex. Cerebellum -CA1)
    4. You can add "+" to terms to require they be in the data
    5. Using autocomplete specifies which branch of our semantics you with to search and can help refine your search
  5. Save Your Search

    You can save any searches you perform for quick access to later from here.

  6. Query Expansion

    We recognized your search term and included synonyms and inferred terms along side your term to help get the data you are looking for.

  7. Collections

    If you are logged into FDI Lab - SciCrunch.org you can add data records to your collections to create custom spreadsheets across multiple sources of data.

  8. Facets

    Here are the facets that you can filter your papers by.

  9. Options

    From here we'll present any options for the literature, such as exporting your current results.

  10. Further Questions

    If you have any further questions please check out our FAQs Page to ask questions and see our tutorials. Click this button to view this tutorial again.

Publications Per Year

X

Year:

Count: