Searching across hundreds of databases

Our searching services are busy right now. Your search will reload in five seconds.

X
Forgot Password

If you have forgotten your password you can enter your email here and get a temporary password sent to your email.

X
Forgot Password

If you have forgotten your password you can enter your email here and get a temporary password sent to your email.

This service exclusively searches for literature that cites resources. Please be aware that the total number of searchable documents is limited to those containing RRIDs and does not include all open-access literature.

Search

Type in a keyword to search

On page 1 showing 1 ~ 20 papers out of 37 papers

Relating hepatocellular carcinoma tumor samples and cell lines using gene expression data in translational research.

  • Bin Chen‎ et al.
  • BMC medical genomics‎
  • 2015‎

Cancer cell lines are used extensively to study cancer biology and to test hypotheses in translational research. The relevance of cell lines is dependent on how closely they resemble the tumors being studied. Relating tumors and cell lines, and recognizing their similarities and differences are thus very important for translational research. Rapid advances in genomics have led to the generation of large volumes of genomic and transcriptomic data for a diverse set of primary cancer samples, normal tissue samples and cancer cell lines. Hepatocellular Carcinoma (HCC) is one of the most common tumors worldwide, with high occurrence in Asia and sub-Saharan regions. The current effective treatments of HCC remain limited. In this work, we compared the gene expression measurements of 200 HCC tumor samples from The Cancer Genome Atlas and over 1000 cancer cell lines including 25 HCC cancer cell lines from Cancer Cell Line Encyclopedia. We showed that the HCC tumor samples correlate closely with HCC cell lines in comparison to cell lines derived from other tumor types. We further demonstrated that the most commonly used HCC cell lines resemble HCC tumors, while we identified nearly half of the cell lines that do not resemble primary tumors. Interestingly, a substantial number of genes that are critical for disease development or drug response are either expressed at low levels or absent among highly correlated cell lines; additional attention should be paid to these genes in translational research. Our study will be used to guide the selection of HCC cell lines and pinpoint the specific genes that are differentially expressed in either tumors or cell lines.


Automatic Labeling of Special Diagnostic Mammography Views from Images and DICOM Headers.

  • Dmytro S Lituiev‎ et al.
  • Journal of digital imaging‎
  • 2019‎

Applying state-of-the-art machine learning techniques to medical images requires a thorough selection and normalization of input data. One of such steps in digital mammography screening for breast cancer is the labeling and removal of special diagnostic views, in which diagnostic tools or magnification are applied to assist in assessment of suspicious initial findings. As a common task in medical informatics is prediction of disease and its stage, these special diagnostic views, which are only enriched among the cohort of diseased cases, will bias machine learning disease predictions. In order to automate this process, here, we develop a machine learning pipeline that utilizes both DICOM headers and images to predict such views in an automatic manner, allowing for their removal and the generation of unbiased datasets. We achieve AUC of 99.72% in predicting special mammogram views when combining both types of models. Finally, we apply these models to clean up a dataset of about 772,000 images with expected sensitivity of 99.0%. The pipeline presented in this paper can be applied to other datasets to obtain high-quality image sets suitable to train algorithms for disease detection.


Precision annotation of digital samples in NCBI's gene expression omnibus.

  • Dexter Hadley‎ et al.
  • Scientific data‎
  • 2017‎

The Gene Expression Omnibus (GEO) contains more than two million digital samples from functional genomics experiments amassed over almost two decades. However, individual sample meta-data remains poorly described by unstructured free text attributes preventing its largescale reanalysis. We introduce the Search Tag Analyze Resource for GEO as a web application (http://STARGEO.org) to curate better annotations of sample phenotypes uniformly across different studies, and to use these sample annotations to define robust genomic signatures of disease pathology by meta-analysis. In this paper, we target a small group of biomedical graduate students to show rapid crowd-curation of precise sample annotations across all phenotypes, and we demonstrate the biological validity of these crowd-curated annotations for breast cancer. STARGEO.org makes GEO data findable, accessible, interoperable and reusable (i.e., FAIR) to ultimately facilitate knowledge discovery. Our work demonstrates the utility of crowd-curation and interpretation of open 'big data' under FAIR principles as a first step towards realizing an ideal paradigm of precision medicine.


Systematic integration of biomedical knowledge prioritizes drugs for repurposing.

  • Daniel Scott Himmelstein‎ et al.
  • eLife‎
  • 2017‎

The ability to computationally predict whether a compound treats a disease would improve the economy and success rate of drug approval. This study describes Project Rephetio to systematically model drug efficacy based on 755 existing treatments. First, we constructed Hetionet (neo4j.het.io), an integrative network encoding knowledge from millions of biomedical studies. Hetionet v1.0 consists of 47,031 nodes of 11 types and 2,250,197 relationships of 24 types. Data were integrated from 29 public resources to connect compounds, diseases, genes, anatomies, pathways, biological processes, molecular functions, cellular components, pharmacologic classes, side effects, and symptoms. Next, we identified network patterns that distinguish treatments from non-treatments. Then, we predicted the probability of treatment for 209,168 compound-disease pairs (het.io/repurpose). Our predictions validated on two external sets of treatment and provided pharmacological insights on epilepsy, suggesting they will help prioritize drug repurposing candidates. This study was entirely open and received realtime feedback from 40 community members.


Multiple RNA structures affect translation initiation and UGA redefinition efficiency during synthesis of selenoprotein P.

  • Marco Mariotti‎ et al.
  • Nucleic acids research‎
  • 2017‎

Gene-specific expansion of the genetic code allows for UGA codons to specify the amino acid selenocysteine (Sec). A striking example of UGA redefinition occurs during translation of the mRNA coding for the selenium transport protein, selenoprotein P (SELENOP), which in vertebrates may contain up to 22 in-frame UGA codons. Sec incorporation at the first and downstream UGA codons occurs with variable efficiencies to control synthesis of full-length and truncated SELENOP isoforms. To address how the Selenop mRNA can direct dynamic codon redefinition in different regions of the same mRNA, we undertook a comprehensive search for phylogenetically conserved RNA structures and examined the function of these structures using cell-based assays, in vitro translation systems, and in vivo ribosome profiling of liver tissue from mice carrying genomic deletions of 3' UTR selenocysteine-insertion-sequences (SECIS1 and SECIS2). The data support a novel RNA structure near the start codon that impacts translation initiation, structures located adjacent to UGA codons, additional coding sequence regions necessary for efficient production of full-length SELENOP, and distinct roles for SECIS1 and SECIS2 at UGA codons. Our results uncover a remarkable diversity of RNA elements conducting multiple occurrences of UGA redefinition to control the synthesis of full-length and truncated SELENOP isoforms.


Copy number variation meta-analysis reveals a novel duplication at 9p24 associated with multiple neurodevelopmental disorders.

  • Joseph T Glessner‎ et al.
  • Genome medicine‎
  • 2017‎

Neurodevelopmental and neuropsychiatric disorders represent a wide spectrum of heterogeneous yet inter-related disease conditions. The overlapping clinical presentations of these diseases suggest a shared genetic etiology. We aim to identify shared structural variants spanning the spectrum of five neuropsychiatric disorders.


Leaf cell-specific and single-cell transcriptional profiling reveals a role for the palisade layer in UV light protection.

  • Carl Procko‎ et al.
  • The Plant cell‎
  • 2022‎

Like other complex multicellular organisms, plants are composed of different cell types with specialized shapes and functions. For example, most laminar leaves consist of multiple photosynthetic cell types. These cell types include the palisade mesophyll, which typically forms one or more cell layers on the adaxial side of the leaf. Despite their importance for photosynthesis, we know little about how palisade cells differ at the molecular level from other photosynthetic cell types. To this end, we have used a combination of cell-specific profiling using fluorescence-activated cell sorting and single-cell RNA-sequencing methods to generate a transcriptional blueprint of the palisade mesophyll in Arabidopsis thaliana leaves. We find that despite their unique morphology, palisade cells are otherwise transcriptionally similar to other photosynthetic cell types. Nevertheless, we show that some genes in the phenylpropanoid biosynthesis pathway have both palisade-enriched expression and are light-regulated. Phenylpropanoid gene activity in the palisade was required for production of the ultraviolet (UV)-B protectant sinapoylmalate, which may protect the palisade and/or other leaf cells against damaging UV light. These findings improve our understanding of how different photosynthetic cell types in the leaf can function uniquely to optimize leaf performance, despite their transcriptional similarities.


Tracing diagnosis trajectories over millions of patients reveal an unexpected risk in schizophrenia.

  • Hyojung Paik‎ et al.
  • Scientific data‎
  • 2019‎

The identification of novel disease associations using big-data for patient care has had limited success. In this study, we created a longitudinal disease network of traced readmissions (disease trajectories), merging data from over 10.4 million inpatients through the Healthcare Cost and Utilization Project, which allowed the representation of disease progression mapping over 300 diseases. From these disease trajectories, we discovered an interesting association between schizophrenia and rhabdomyolysis, a rare muscle disease (incidence < 1E-04) (relative risk, 2.21 [1.80-2.71, confidence interval = 0.95], P-value 9.54E-15). We validated this association by using independent electronic medical records from over 830,000 patients at the University of California, San Francisco (UCSF) medical center. A case review of 29 rhabdomyolysis incidents in schizophrenia patients at UCSF demonstrated that 62% are idiopathic, without the use of any drug known to lead to this adverse event, suggesting a warning to physicians to watch for this unexpected risk of schizophrenia. Large-scale analysis of disease trajectories can help physicians understand potential sequential events in their patients.


The impact of the metabotropic glutamate receptor and other gene family interaction networks on autism.

  • Dexter Hadley‎ et al.
  • Nature communications‎
  • 2014‎

Although multiple reports show that defective genetic networks underlie the aetiology of autism, few have translated into pharmacotherapeutic opportunities. Since drugs compete with endogenous small molecules for protein binding, many successful drugs target large gene families with multiple drug binding sites. Here we search for defective gene family interaction networks (GFINs) in 6,742 patients with the ASDs relative to 12,544 neurologically normal controls, to find potentially druggable genetic targets. We find significant enrichment of structural defects (P ≤ 2.40E-09, 1.8-fold enrichment) in the metabotropic glutamate receptor (GRM) GFIN, previously observed to impact attention deficit hyperactivity disorder (ADHD) and schizophrenia. Also, the MXD-MYC-MAX network of genes, previously implicated in cancer, is significantly enriched (P ≤ 3.83E-23, 2.5-fold enrichment), as is the calmodulin 1 (CALM1) gene interaction network (P ≤ 4.16E-04, 14.4-fold enrichment), which regulates voltage-independent calcium-activated action potentials at the neuronal synapse. We find that multiple defective gene family interactions underlie autism, presenting new translational opportunities to explore for therapeutic interventions.


A candidate gene approach identifies the CHRNA5-A3-B4 region as a risk factor for age-dependent nicotine addiction.

  • Robert B Weiss‎ et al.
  • PLoS genetics‎
  • 2008‎

People who begin daily smoking at an early age are at greater risk of long-term nicotine addiction. We tested the hypothesis that associations between nicotinic acetylcholine receptor (nAChR) genetic variants and nicotine dependence assessed in adulthood will be stronger among smokers who began daily nicotine exposure during adolescence. We compared nicotine addiction-measured by the Fagerstrom Test of Nicotine Dependence-in three cohorts of long-term smokers recruited in Utah, Wisconsin, and by the NHLBI Lung Health Study, using a candidate-gene approach with the neuronal nAChR subunit genes. This SNP panel included common coding variants and haplotypes detected in eight alpha and three beta nAChR subunit genes found in European American populations. In the 2,827 long-term smokers examined, common susceptibility and protective haplotypes at the CHRNA5-A3-B4 locus were associated with nicotine dependence severity (p = 2.0x10(-5); odds ratio = 1.82; 95% confidence interval 1.39-2.39) in subjects who began daily smoking at or before the age of 16, an exposure period that results in a more severe form of adult nicotine dependence. A substantial shift in susceptibility versus protective diplotype frequency (AA versus BC = 17%, AA versus CC = 27%) was observed in the group that began smoking by age 16. This genetic effect was not observed in subjects who began daily nicotine use after the age of 16. These results establish a strong mechanistic link among early nicotine exposure, common CHRNA5-A3-B4 haplotypes, and adult nicotine addiction in three independent populations of European origins. The identification of an age-dependent susceptibility haplotype reinforces the importance of preventing early exposure to tobacco through public health policies.


Patterns of sequence conservation in presynaptic neural genes.

  • Dexter Hadley‎ et al.
  • Genome biology‎
  • 2006‎

The neuronal synapse is a fundamental functional unit in the central nervous system of animals. Because synaptic function is evolutionarily conserved, we reasoned that functional sequences of genes and related genomic elements known to play important roles in neurotransmitter release would also be conserved.


Rare copy number variants in over 100,000 European ancestry subjects reveal multiple disease associations.

  • Yun Rose Li‎ et al.
  • Nature communications‎
  • 2020‎

Copy number variants (CNVs) are suggested to have a widespread impact on the human genome and phenotypes. To understand the role of CNVs across human diseases, we examine the CNV genomic landscape of 100,028 unrelated individuals of European ancestry, using SNP and CGH array datasets. We observe an average CNV burden of ~650 kb, identifying a total of 11,314 deletion, 5625 duplication, and 2746 homozygous deletion CNV regions (CNVRs). In all, 13.7% are unreported, 58.6% overlap with at least one gene, and 32.8% interrupt coding exons. These CNVRs are significantly more likely to overlap OMIM genes (2.94-fold), GWAS loci (1.52-fold), and non-coding RNAs (1.44-fold), compared with random distribution (P < 1 × 10-3). We uncover CNV associations with four major disease categories, including autoimmune, cardio-metabolic, oncologic, and neurological/psychiatric diseases, and identify several drug-repurposing opportunities. Our results demonstrate robust frequency definition for large-scale rare variant association studies, identify CNVs associated with major disease categories, and illustrate the pleiotropic impact of CNVs in human disease.


Demonstration of Protein-Based Human Identification Using the Hair Shaft Proteome.

  • Glendon J Parker‎ et al.
  • PloS one‎
  • 2016‎

Human identification from biological material is largely dependent on the ability to characterize genetic polymorphisms in DNA. Unfortunately, DNA can degrade in the environment, sometimes below the level at which it can be amplified by PCR. Protein however is chemically more robust than DNA and can persist for longer periods. Protein also contains genetic variation in the form of single amino acid polymorphisms. These can be used to infer the status of non-synonymous single nucleotide polymorphism alleles. To demonstrate this, we used mass spectrometry-based shotgun proteomics to characterize hair shaft proteins in 66 European-American subjects. A total of 596 single nucleotide polymorphism alleles were correctly imputed in 32 loci from 22 genes of subjects' DNA and directly validated using Sanger sequencing. Estimates of the probability of resulting individual non-synonymous single nucleotide polymorphism allelic profiles in the European population, using the product rule, resulted in a maximum power of discrimination of 1 in 12,500. Imputed non-synonymous single nucleotide polymorphism profiles from European-American subjects were considerably less frequent in the African population (maximum likelihood ratio = 11,000). The converse was true for hair shafts collected from an additional 10 subjects with African ancestry, where some profiles were more frequent in the African population. Genetically variant peptides were also identified in hair shaft datasets from six archaeological skeletal remains (up to 260 years old). This study demonstrates that quantifiable measures of identity discrimination and biogeographic background can be obtained from detecting genetically variant peptides in hair shaft protein, including hair from bioarchaeological contexts.


Stretch-activated ion channels identified in the touch-sensitive structures of carnivorous Droseraceae plants.

  • Carl Procko‎ et al.
  • eLife‎
  • 2021‎

In response to touch, some carnivorous plants such as the Venus flytrap have evolved spectacular movements to capture animals for nutrient acquisition. However, the molecules that confer this sensitivity remain unknown. We used comparative transcriptomics to show that expression of three genes encoding homologs of the MscS-Like (MSL) and OSCA/TMEM63 family of mechanosensitive ion channels are localized to touch-sensitive trigger hairs of Venus flytrap. We focus here on the candidate with the most enriched expression in trigger hairs, the MSL homolog FLYCATCHER1 (FLYC1). We show that FLYC1 transcripts are localized to mechanosensory cells within the trigger hair, transfecting FLYC1 induces chloride-permeable stretch-activated currents in naïve cells, and transcripts coding for FLYC1 homologs are expressed in touch-sensing cells of Cape sundew, a related carnivorous plant of the Droseraceae family. Our data suggest that the mechanism of prey recognition in carnivorous Droseraceae evolved by co-opting ancestral mechanosensitive ion channels to sense touch.


A genome-wide association study identifies only two ancestry specific variants associated with spontaneous preterm birth.

  • Nadav Rappoport‎ et al.
  • Scientific reports‎
  • 2018‎

Preterm birth (PTB), or the delivery prior to 37 weeks of gestation, is a significant cause of infant morbidity and mortality. Although twin studies estimate that maternal genetic contributions account for approximately 30% of the incidence of PTB, and other studies reported fetal gene polymorphism association, to date no consistent associations have been identified. In this study, we performed the largest reported genome-wide association study analysis on 1,349 cases of PTB and 12,595 ancestry-matched controls from the focusing on genomic fetal signals. We tested over 2 million single nucleotide polymorphisms (SNPs) for associations with PTB across five subpopulations: African (AFR), the Americas (AMR), European, South Asian, and East Asian. We identified only two intergenic loci associated with PTB at a genome-wide level of significance: rs17591250 (P = 4.55E-09) on chromosome 1 in the AFR population and rs1979081 (P = 3.72E-08) on chromosome 8 in the AMR group. We have queried several existing replication cohorts and found no support of these associations. We conclude that the fetal genetic contribution to PTB is unlikely due to single common genetic variant, but could be explained by interactions of multiple common variants, or of rare variants affected by environmental influences, all not detectable using a GWAS alone.


Meta-Analysis Reveals the Prognostic Relevance of Nuclear and Membrane-Associated Bile Acid Receptors in Gastric Cancer.

  • Michael Rohr‎ et al.
  • Clinical and translational gastroenterology‎
  • 2021‎

Bile acids (BAs) arising from duodenogastric reflux are known to facilitate gastric cancer (GC) development. Although BAs traditionally contribute to carcinogenesis through direct cellular cytotoxicity, increasing evidence implicates nuclear and membrane BA receptors (BARs) as additional factors influencing cancer risk. Indeed, some BARs are already linked with GC, but conflicting evidence and lack of information regarding other endogenous BARs warrant further investigation. In this study, we meta-analyzed multiple data sets to identify clinically relevant relationships between BAR expression and prognosis, clinicopathology, and activity in GC.


Transcriptome changes in stages of non-alcoholic fatty liver disease.

  • Jihad Aljabban‎ et al.
  • World journal of hepatology‎
  • 2022‎

Non-alcoholic fatty liver disease (NAFLD) is the most common chronic liver disease in the United States and globally. The currently understood model of pathogenesis consists of a 'multiple hit' hypothesis in which environmental and genetic factors contribute to hepatic inflammation and injury.


De novo structural mutation rates and gamete-of-origin biases revealed through genome sequencing of 2,396 families.

  • Jonathan R Belyeu‎ et al.
  • American journal of human genetics‎
  • 2021‎

Each human genome includes de novo mutations that arose during gametogenesis. While these germline mutations represent a fundamental source of new genetic diversity, they can also create deleterious alleles that impact fitness. Whereas the rate and patterns of point mutations in the human germline are now well understood, far less is known about the frequency and features that impact de novo structural variants (dnSVs). We report a family-based study of germline mutations among 9,599 human genomes from 33 multigenerational CEPH-Utah families and 2,384 families from the Simons Foundation Autism Research Initiative. We find that de novo structural mutations detected by alignment-based, short-read WGS occur at an overall rate of at least 0.160 events per genome in unaffected individuals, and we observe a significantly higher rate (0.206 per genome) in ASD-affected individuals. In both probands and unaffected samples, nearly 73% of de novo structural mutations arose in paternal gametes, and we predict most de novo structural mutations to be caused by mutational mechanisms that do not require sequence homology. After multiple testing correction, we did not observe a statistically significant correlation between parental age and the rate of de novo structural variation in offspring. These results highlight that a spectrum of mutational mechanisms contribute to germline structural mutations and that these mechanisms most likely have markedly different rates and selective pressures than those leading to point mutations.


Cis-eQTL-based trans-ethnic meta-analysis reveals novel genes associated with breast cancer risk.

  • Joshua D Hoffman‎ et al.
  • PLoS genetics‎
  • 2017‎

Breast cancer is the most common solid organ malignancy and the most frequent cause of cancer death among women worldwide. Previous research has yielded insights into its genetic etiology, but there remains a gap in the understanding of genetic factors that contribute to risk, and particularly in the biological mechanisms by which genetic variation modulates risk. The National Cancer Institute's "Up for a Challenge" (U4C) competition provided an opportunity to further elucidate the genetic basis of the disease. Our group leveraged the seven datasets made available by the U4C organizers and data from the publicly available UK Biobank cohort to examine associations between imputed gene expression and breast cancer risk. In particular, we used reference datasets describing the breast tissue and whole blood transcriptomes to impute expression levels in breast cancer cases and controls. In trans-ethnic meta-analyses of U4C and UK Biobank data, we found significant associations between breast cancer risk and the expression of RCCD1 (joint p-value: 3.6x10-06) and DHODH (p-value: 7.1x10-06) in breast tissue, as well as a suggestive association for ANKLE1 (p-value: 9.3x10-05). Expression of RCCD1 in whole blood was also suggestively associated with disease risk (p-value: 1.2x10-05), as were expression of ACAP1 (p-value: 1.9x10-05) and LRRC25 (p-value: 5.2x10-05). While genome-wide association studies (GWAS) have implicated RCCD1 and ANKLE1 in breast cancer risk, they have not identified the remaining three genes. Among the genetic variants that contributed to the predicted expression of the five genes, we found 23 nominally (p-value < 0.05) associated with breast cancer risk, among which 15 are not in high linkage disequilibrium with risk variants previously identified by GWAS. In summary, we used a transcriptome-based approach to investigate the genetic underpinnings of breast carcinogenesis. This approach provided an avenue for deciphering the functional relevance of genes and genetic variants involved in breast cancer.


Systematic data-querying of large pediatric biorepository identifies novel Ehlers-Danlos Syndrome variant.

  • Akshatha Desai‎ et al.
  • BMC musculoskeletal disorders‎
  • 2016‎

Ehlers Danlos Syndrome is a rare form of inherited connective tissue disorder, which primarily affects skin, joints, muscle, and blood cells. The current study aimed at finding the mutation that causing EDS type VII C also known as "Dermatosparaxis" in this family.


  1. SciCrunch.org Resources

    Welcome to the FDI Lab - SciCrunch.org Resources search. From here you can search through a compilation of resources used by FDI Lab - SciCrunch.org and see how data is organized within our community.

  2. Navigation

    You are currently on the Community Resources tab looking through categories and sources that FDI Lab - SciCrunch.org has compiled. You can navigate through those categories from here or change to a different tab to execute your search through. Each tab gives a different perspective on data.

  3. Logging in and Registering

    If you have an account on FDI Lab - SciCrunch.org then you can log in from here to get additional features in FDI Lab - SciCrunch.org such as Collections, Saved Searches, and managing Resources.

  4. Searching

    Here is the search term that is being executed, you can type in anything you want to search for. Some tips to help searching:

    1. Use quotes around phrases you want to match exactly
    2. You can manually AND and OR terms to change how we search between words
    3. You can add "-" to terms to make sure no results return with that term in them (ex. Cerebellum -CA1)
    4. You can add "+" to terms to require they be in the data
    5. Using autocomplete specifies which branch of our semantics you with to search and can help refine your search
  5. Save Your Search

    You can save any searches you perform for quick access to later from here.

  6. Query Expansion

    We recognized your search term and included synonyms and inferred terms along side your term to help get the data you are looking for.

  7. Collections

    If you are logged into FDI Lab - SciCrunch.org you can add data records to your collections to create custom spreadsheets across multiple sources of data.

  8. Facets

    Here are the facets that you can filter your papers by.

  9. Options

    From here we'll present any options for the literature, such as exporting your current results.

  10. Further Questions

    If you have any further questions please check out our FAQs Page to ask questions and see our tutorials. Click this button to view this tutorial again.

Publications Per Year

X

Year:

Count: