FDI Lab - SciCrunch.org | Searching in Literature

New implications on genomic adaptation derived from the Helicobacter pylori genome comparison.

Edgar Eduardo Lara-Ramírez‎ et al.
PloS one‎
2011‎

Helicobacter pylori has a reduced genome and lives in a tough environment for long-term persistence. It evolved with its particular characteristics for biological adaptation. Because several H. pylori genome sequences are available, comparative analysis could help to better understand genomic adaptation of this particular bacterium.

A large-scale crop protection bioassay data set.

Anna Gaulton‎ et al.
Scientific data‎
2015‎

ChEMBL is a large-scale drug discovery database containing bioactivity information primarily extracted from scientific literature. Due to the medicinal chemistry focus of the journals from which data are extracted, the data are currently of most direct value in the field of human health research. However, many of the scientific use-cases for the current data set are equally applicable in other fields, such as crop protection research: for example, identification of chemical scaffolds active against a particular target or endpoint, the de-convolution of the potential targets of a phenotypic assay, or the potential targets/pathways for safety liabilities. In order to broaden the applicability of the ChEMBL database and allow more widespread use in crop protection research, an extensive data set of bioactivity data of insecticidal, fungicidal and herbicidal compounds and assays was collated and added to the database.

Large scale comparison of QSAR and conformal prediction methods and their applications in drug discovery.

Nicolas Bosc‎ et al.
Journal of cheminformatics‎
2019‎

Structure-activity relationship modelling is frequently used in the early stage of drug discovery to assess the activity of a compound on one or several targets, and can also be used to assess the interaction of compounds with liability targets. QSAR models have been used for these and related applications over many years, with good success. Conformal prediction is a relatively new QSAR approach that provides information on the certainty of a prediction, and so helps in decision-making. However, it is not always clear how best to make use of this additional information. In this article, we describe a case study that directly compares conformal prediction with traditional QSAR methods for large-scale predictions of target-ligand binding. The ChEMBL database was used to extract a data set comprising data from 550 human protein targets with different bioactivity profiles. For each target, a QSAR model and a conformal predictor were trained and their results compared. The models were then evaluated on new data published since the original models were built to simulate a "real world" application. The comparative study highlights the similarities between the two techniques but also some differences that it is important to bear in mind when the methods are used in practical drug discovery applications.

The ChEMBL bioactivity database: an update.

A Patrícia Bento‎ et al.
Nucleic acids research‎
2014‎

ChEMBL is an open large-scale bioactivity database (https://www.ebi.ac.uk/chembl), previously described in the 2012 Nucleic Acids Research Database Issue. Since then, a variety of new data sources and improvements in functionality have contributed to the growth and utility of the resource. In particular, more comprehensive tracking of compounds from research stages through clinical development to market is provided through the inclusion of data from United States Adopted Name applications; a new richer data model for representing drug targets has been developed; and a number of methods have been put in place to allow users to more easily identify reliable data. Finally, access to ChEMBL is now available via a new Resource Description Framework format, in addition to the web-based interface, data downloads and web services.

Pharos: Collating protein information to shed light on the druggable genome.

Dac-Trung Nguyen‎ et al.
Nucleic acids research‎
2017‎

The 'druggable genome' encompasses several protein families, but only a subset of targets within them have attracted significant research attention and thus have information about them publicly available. The Illuminating the Druggable Genome (IDG) program was initiated in 2014, has the goal of developing experimental techniques and a Knowledge Management Center (KMC) that would collect and organize information about protein targets from four families, representing the most common druggable targets with an emphasis on understudied proteins. Here, we describe two resources developed by the KMC: the Target Central Resource Database (TCRD) which collates many heterogeneous gene/protein datasets and Pharos (https://pharos.nih.gov), a multimodal web interface that presents the data from TCRD. We briefly describe the types and sources of data considered by the KMC and then highlight features of the Pharos interface designed to enable intuitive access to the IDG knowledgebase. The aim of Pharos is to encourage 'serendipitous browsing', whereby related, relevant information is made easily discoverable. We conclude by describing two use cases that highlight the utility of Pharos and TCRD.

UniChem: a unified chemical structure cross-referencing and identifier tracking system.

Jon Chambers‎ et al.
Journal of cheminformatics‎
2013‎

UniChem is a freely available compound identifier mapping service on the internet, designed to optimize the efficiency with which structure-based hyperlinks may be built and maintained between chemistry-based resources. In the past, the creation and maintenance of such links at EMBL-EBI, where several chemistry-based resources exist, has required independent efforts by each of the separate teams. These efforts were complicated by the different data models, release schedules, and differing business rules for compound normalization and identifier nomenclature that exist across the organization. UniChem, a large-scale, non-redundant database of Standard InChIs with pointers between these structures and chemical identifiers from all the separate chemistry resources, was developed as a means of efficiently sharing the maintenance overhead of creating these links. Thus, for each source represented in UniChem, all links to and from all other sources are automatically calculated and immediately available for all to use. Updated mappings are immediately available upon loading of new data releases from the sources. Web services in UniChem provide users with a single simple automatable mechanism for maintaining all links from their resource to all other sources represented in UniChem. In addition, functionality to track changes in identifier usage allows users to monitor which identifiers are current, and which are obsolete. Lastly, UniChem has been deliberately designed to allow additional resources to be included with minimal effort. Indeed, the recent inclusion of data sources external to EMBL-EBI has provided a simple means of providing users with an even wider selection of resources with which to link to, all at no extra cost, while at the same time providing a simple mechanism for external resources to link to all EMBL-EBI chemistry resources.

Improving the odds of drug development success through human genomics: modelling study.

Aroon D Hingorani‎ et al.
Scientific reports‎
2019‎

Lack of efficacy in the intended disease indication is the major cause of clinical phase drug development failure. Explanations could include the poor external validity of pre-clinical (cell, tissue, and animal) models of human disease and the high false discovery rate (FDR) in preclinical science. FDR is related to the proportion of true relationships available for discovery (γ), and the type 1 (false-positive) and type 2 (false negative) error rates of the experiments designed to uncover them. We estimated the FDR in preclinical science, its effect on drug development success rates, and improvements expected from use of human genomics rather than preclinical studies as the primary source of evidence for drug target identification. Calculations were based on a sample space defined by all human diseases - the 'disease-ome' - represented as columns; and all protein coding genes - 'the protein-coding genome'- represented as rows, producing a matrix of unique gene- (or protein-) disease pairings. We parameterised the space based on 10,000 diseases, 20,000 protein-coding genes, 100 causal genes per disease and 4000 genes encoding druggable targets, examining the effect of varying the parameters and a range of underlying assumptions, on the inferences drawn. We estimated γ, defined mathematical relationships between preclinical FDR and drug development success rates, and estimated improvements in success rates based on human genomics (rather than orthodox preclinical studies). Around one in every 200 protein-disease pairings was estimated to be causal (γ = 0.005) giving an FDR in preclinical research of 92.6%, which likely makes a major contribution to the reported drug development failure rate of 96%. Observed success rate was only slightly greater than expected for a random pick from the sample space. Values for γ back-calculated from reported preclinical and clinical drug development success rates were also close to the a priori estimates. Substituting genome wide (or druggable genome wide) association studies for preclinical studies as the major information source for drug target identification was estimated to reverse the probability of late stage failure because of the more stringent type 1 error rate employed and the ability to interrogate every potential druggable target in the same experiment. Genetic studies conducted at much larger scale, with greater resolution of disease end-points, e.g. by connecting genomics and electronic health record data within healthcare systems has the potential to produce radical improvement in drug development success rate.

Actionable druggable genome-wide Mendelian randomization identifies repurposing opportunities for COVID-19.

Liam Gaziano‎ et al.
Nature medicine‎
2021‎

Drug repurposing provides a rapid approach to meet the urgent need for therapeutics to address COVID-19. To identify therapeutic targets relevant to COVID-19, we conducted Mendelian randomization analyses, deriving genetic instruments based on transcriptomic and proteomic data for 1,263 actionable proteins that are targeted by approved drugs or in clinical phase of drug development. Using summary statistics from the Host Genetics Initiative and the Million Veteran Program, we studied 7,554 patients hospitalized with COVID-19 and >1 million controls. We found significant Mendelian randomization results for three proteins (ACE2, P = 1.6 × 10-6; IFNAR2, P = 9.8 × 10-11 and IL-10RB, P = 2.3 × 10-14) using cis-expression quantitative trait loci genetic instruments that also had strong evidence for colocalization with COVID-19 hospitalization. To disentangle the shared expression quantitative trait loci signal for IL10RB and IFNAR2, we conducted phenome-wide association scans and pathway enrichment analysis, which suggested that IFNAR2 is more likely to play a role in COVID-19 hospitalization. Our findings prioritize trials of drugs targeting IFNAR2 and ACE2 for early management of COVID-19.

The ChEMBL Database in 2023: a drug discovery platform spanning multiple bioactivity data types and time periods.

Barbara Zdrazil‎ et al.
Nucleic acids research‎
2024‎

ChEMBL (https://www.ebi.ac.uk/chembl/) is a manually curated, high-quality, large-scale, open, FAIR and Global Core Biodata Resource of bioactive molecules with drug-like properties, previously described in the 2012, 2014, 2017 and 2019 Nucleic Acids Research Database Issues. Since its introduction in 2009, ChEMBL's content has changed dramatically in size and diversity of data types. Through incorporation of multiple new datasets from depositors since the 2019 update, ChEMBL now contains slightly more bioactivity data from deposited data vs data extracted from literature. In collaboration with the EUbOPEN consortium, chemical probe data is now regularly deposited into ChEMBL. Release 27 made curated data available for compounds screened for potential anti-SARS-CoV-2 activity from several large-scale drug repurposing screens. In addition, new patent bioactivity data have been added to the latest ChEMBL releases, and various new features have been incorporated, including a Natural Product likeness score, updated flags for Natural Products, a new flag for Chemical Probes, and the initial annotation of the action type for ∼270 000 bioactivity measurements.

The EBI RDF platform: linked open data for the life sciences.

Simon Jupp‎ et al.
Bioinformatics (Oxford, England)‎
2014‎

Resource description framework (RDF) is an emerging technology for describing, publishing and linking life science data. As a major provider of bioinformatics data and services, the European Bioinformatics Institute (EBI) is committed to making data readily accessible to the community in ways that meet existing demand. The EBI RDF platform has been developed to meet an increasing demand to coordinate RDF activities across the institute and provides a new entry point to querying and exploring integrated resources available at the EBI.

A viral-human interactome based on structural motif-domain interactions captures the human infectome.

Aldo Segura-Cabrera‎ et al.
PloS one‎
2013‎

Protein interactions between a pathogen and its host are fundamental in the establishment of the pathogen and underline the infection mechanism. In the present work, we developed a single predictive model for building a host-viral interactome based on the identification of structural descriptors from motif-domain interactions of protein complexes deposited in the Protein Data Bank (PDB). The structural descriptors were used for searching, in a database of protein sequences of human and five clinically important viruses; therefore, viral and human proteins sharing a descriptor were predicted as interacting proteins. The analysis of the host-viral interactome allowed to identify a set of new interactions that further explain molecular mechanism associated with viral infections and showed that it was able to capture human proteins already associated to viral infections (human infectome) and non-infectious diseases (human diseasome). The analysis of human proteins targeted by viral proteins in the context of a human interactome showed that their neighbors are enriched in proteins reported with differential expression under infection and disease conditions. It is expected that the findings of this work will contribute to the development of systems biology for infectious diseases, and help guide the rational identification and prioritization of novel drug targets.

The Global Phosphorylation Landscape of SARS-CoV-2 Infection.

Mehdi Bouhaddou‎ et al.
Cell‎
2020‎

The causative agent of the coronavirus disease 2019 (COVID-19) pandemic, severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2), has infected millions and killed hundreds of thousands of people worldwide, highlighting an urgent need to develop antiviral therapies. Here we present a quantitative mass spectrometry-based phosphoproteomics survey of SARS-CoV-2 infection in Vero E6 cells, revealing dramatic rewiring of phosphorylation on host and viral proteins. SARS-CoV-2 infection promoted casein kinase II (CK2) and p38 MAPK activation, production of diverse cytokines, and shutdown of mitotic kinases, resulting in cell cycle arrest. Infection also stimulated a marked induction of CK2-containing filopodial protrusions possessing budding viral particles. Eighty-seven drugs and compounds were identified by mapping global phosphorylation profiles to dysregulated kinases and pathways. We found pharmacologic inhibition of the p38, CK2, CDK, AXL, and PIKFYVE kinases to possess antiviral efficacy, representing potential COVID-19 therapies.

Drug mechanism-of-action discovery through the integration of pharmacological and CRISPR screens.

Emanuel Gonçalves‎ et al.
Molecular systems biology‎
2020‎

Low success rates during drug development are due, in part, to the difficulty of defining drug mechanism-of-action and molecular markers of therapeutic activity. Here, we integrated 199,219 drug sensitivity measurements for 397 unique anti-cancer drugs with genome-wide CRISPR loss-of-function screens in 484 cell lines to systematically investigate cellular drug mechanism-of-action. We observed an enrichment for positive associations between the profile of drug sensitivity and knockout of a drug's nominal target, and by leveraging protein-protein networks, we identified pathways underpinning drug sensitivity. This revealed an unappreciated positive association between mitochondrial E3 ubiquitin-protein ligase MARCH5 dependency and sensitivity to MCL1 inhibitors in breast cancer cell lines. We also estimated drug on-target and off-target activity, informing on specificity, potency and toxicity. Linking drug and gene dependency together with genomic data sets uncovered contexts in which molecular networks when perturbed mediate cancer cell loss-of-fitness and thereby provide independent and orthogonal evidence of biomarkers for drug development. This study illustrates how integrating cell line drug sensitivity with CRISPR loss-of-function screens can elucidate mechanism-of-action to advance drug development.

Open Targets: a platform for therapeutic target identification and validation.

Gautier Koscielny‎ et al.
Nucleic acids research‎
2017‎

We have designed and developed a data integration and visualization platform that provides evidence about the association of known and potential drug targets with diseases. The platform is designed to support identification and prioritization of biological targets for follow-up. Each drug target is linked to a disease using integrated genome-wide data from a broad range of data sources. The platform provides either a target-centric workflow to identify diseases that may be associated with a specific target, or a disease-centric workflow to identify targets that may be associated with a specific disease. Users can easily transition between these target- and disease-centric workflows. The Open Targets Validation Platform is accessible at https://www.targetvalidation.org.

Drug Safety Data Curation and Modeling in ChEMBL: Boxed Warnings and Withdrawn Drugs.

Fiona M I Hunter‎ et al.
Chemical research in toxicology‎
2021‎

The safety of marketed drugs is an ongoing concern, with some of the more frequently prescribed medicines resulting in serious or life-threatening adverse effects in some patients. Safety-related information for approved drugs has been curated to include the assignment of toxicity class(es) based on their withdrawn status and/or black box warning information described on medicinal product labels. The ChEMBL resource contains a wide range of bioactivity data types, from early "Discovery" stage preclinical data for individual compounds through to postclinical data on marketed drugs; the inclusion of the curated drug safety data set within this framework can support a wide range of safety-related drug discovery questions. The curated drug safety data set will be made freely available through ChEMBL and updated in future database releases.

Pranlukast Antagonizes CD49f and Reduces Stemness in Triple-Negative Breast Cancer Cells.

Inés Velázquez-Quesada‎ et al.
Drug design, development and therapy‎
2020‎

Cancer stem cells (CSCs) drive the initiation, maintenance, and therapy response of breast tumors. CD49f is expressed in breast CSCs and functions in the maintenance of stemness. Thus, blockade of CD49f is a potential therapeutic approach for targeting breast CSCs. In the present study, we aimed to repurpose drugs as CD49f antagonists.

A large-scale dataset of in vivo pharmacology assay results.

Fiona M I Hunter‎ et al.
Scientific data‎
2018‎

ChEMBL is a large-scale, open-access drug discovery resource containing bioactivity information primarily extracted from scientific literature. A substantial dataset of more than 135,000 in vivo assays has been collated as a key resource of animal models for translational medicine within drug discovery. To improve the utility of the in vivo data, an extensive data curation task has been undertaken that allows the assays to be grouped by animal disease model or phenotypic endpoint. The dataset contains previously unavailable information about compounds or drugs tested in animal models and, in conjunction with assay data on protein targets or cell- or tissue- based systems, allows the investigation of the effects of compounds at differing levels of biological complexity. Equally, it enables researchers to identify compounds that have been investigated for a group of disease-, pharmacology- or toxicity-relevant assays.

ChEMBL: towards direct deposition of bioassay data.

David Mendez‎ et al.
Nucleic acids research‎
2019‎

ChEMBL is a large, open-access bioactivity database (https://www.ebi.ac.uk/chembl), previously described in the 2012, 2014 and 2017 Nucleic Acids Research Database Issues. In the last two years, several important improvements have been made to the database and are described here. These include more robust capture and representation of assay details; a new data deposition system, allowing updating of data sets and deposition of supplementary data; and a completely redesigned web interface, with enhanced search and filtering capabilities.

Potential mechanism of action of meso-dihydroguaiaretic acid on Mycobacterium tuberculosis H37Rv.

Aldo F Clemente-Soto‎ et al.
Molecules (Basel, Switzerland)‎
2014‎

The isolation and characterization of the lignan meso-dihydroguaiaretic acid (MDGA) from Larrea tridentata and its activity against Mycobacterial tuberculosis has been demonstrated, but no information regarding its mechanism of action has been documented. Therefore, in this study we carry out the gene expression from total RNA obtained from M. tuberculosis H37Rv treated with MDGA using microarray technology, which was validated by quantitative real time polymerase chain reaction. Results showed that the alpha subunit of coenzyme A transferase of M. tuberculosis H37Rv is present in both geraniol and 1-and 2-methylnaphthalene degradation pathways, which are targeted by MDGA. This assumption was supported by molecular docking which showed stable interaction between MDGA with the active site of the enzyme. We propose that inhibition of coenzyme A transferase of M. tuberculosis H37Rv results in the accumulation of geraniol and 1-and 2-methylnaphtalene inside bacteria, causing membrane destabilization and death of the pathogen. The natural product MDGA is thus an attractive template to develop new anti-tuberculosis drugs, because its target is different from those of known anti-tubercular agents.

The ChEMBL database in 2017.

Anna Gaulton‎ et al.
Nucleic acids research‎
2017‎

ChEMBL is an open large-scale bioactivity database (https://www.ebi.ac.uk/chembl), previously described in the 2012 and 2014 Nucleic Acids Research Database Issues. Since then, alongside the continued extraction of data from the medicinal chemistry literature, new sources of bioactivity data have also been added to the database. These include: deposited data sets from neglected disease screening; crop protection data; drug metabolism and disposition data and bioactivity data from patents. A number of improvements and new features have also been incorporated. These include the annotation of assays and targets using ontologies, the inclusion of targets and indications for clinical candidates, addition of metabolic pathways for drugs and calculation of structural alerts. The ChEMBL data can be accessed via a web-interface, RDF distribution, data downloads and RESTful web-services.

Searching across hundreds of databases

Our searching services are busy right now. Your search will reload in five seconds.

New implications on genomic adaptation derived from the Helicobacter pylori genome comparison.

A large-scale crop protection bioassay data set.

Large scale comparison of QSAR and conformal prediction methods and their applications in drug discovery.

The ChEMBL bioactivity database: an update.

Pharos: Collating protein information to shed light on the druggable genome.

UniChem: a unified chemical structure cross-referencing and identifier tracking system.

Improving the odds of drug development success through human genomics: modelling study.

Actionable druggable genome-wide Mendelian randomization identifies repurposing opportunities for COVID-19.

The ChEMBL Database in 2023: a drug discovery platform spanning multiple bioactivity data types and time periods.

The EBI RDF platform: linked open data for the life sciences.

A viral-human interactome based on structural motif-domain interactions captures the human infectome.

The Global Phosphorylation Landscape of SARS-CoV-2 Infection.

Drug mechanism-of-action discovery through the integration of pharmacological and CRISPR screens.

Open Targets: a platform for therapeutic target identification and validation.

Drug Safety Data Curation and Modeling in ChEMBL: Boxed Warnings and Withdrawn Drugs.

Pranlukast Antagonizes CD49f and Reduces Stemness in Triple-Negative Breast Cancer Cells.

A large-scale dataset of in vivo pharmacology assay results.

ChEMBL: towards direct deposition of bioassay data.

Potential mechanism of action of meso-dihydroguaiaretic acid on Mycobacterium tuberculosis H37Rv.

The ChEMBL database in 2017.

SciCrunch.org Resources

Navigation

Logging in and Registering

Searching

Save Your Search

Query Expansion

Collections

Facets

Options

Further Questions

About

Recent News Entries

Contact Us

SciCrunch

Searching across hundreds of databases

Our searching services are busy right now. Your search will reload in five seconds.

Log in

Log in

Literature

Current Facets and Filters

Options

Facets

Recent searches

.in-collection { color: green; } New implications on genomic adaptation derived from the Helicobacter pylori genome comparison.

.in-collection { color: green; } A large-scale crop protection bioassay data set.

.in-collection { color: green; } Large scale comparison of QSAR and conformal prediction methods and their applications in drug discovery.

.in-collection { color: green; } The ChEMBL bioactivity database: an update.

.in-collection { color: green; } Pharos: Collating protein information to shed light on the druggable genome.

.in-collection { color: green; } UniChem: a unified chemical structure cross-referencing and identifier tracking system.

.in-collection { color: green; } Improving the odds of drug development success through human genomics: modelling study.

.in-collection { color: green; } Actionable druggable genome-wide Mendelian randomization identifies repurposing opportunities for COVID-19.

.in-collection { color: green; } The ChEMBL Database in 2023: a drug discovery platform spanning multiple bioactivity data types and time periods.

.in-collection { color: green; } The EBI RDF platform: linked open data for the life sciences.

.in-collection { color: green; } A viral-human interactome based on structural motif-domain interactions captures the human infectome.

.in-collection { color: green; } The Global Phosphorylation Landscape of SARS-CoV-2 Infection.

.in-collection { color: green; } Drug mechanism-of-action discovery through the integration of pharmacological and CRISPR screens.

.in-collection { color: green; } Open Targets: a platform for therapeutic target identification and validation.

.in-collection { color: green; } Drug Safety Data Curation and Modeling in ChEMBL: Boxed Warnings and Withdrawn Drugs.

.in-collection { color: green; } Pranlukast Antagonizes CD49f and Reduces Stemness in Triple-Negative Breast Cancer Cells.

.in-collection { color: green; } A large-scale dataset of in vivo pharmacology assay results.

.in-collection { color: green; } ChEMBL: towards direct deposition of bioassay data.

.in-collection { color: green; } Potential mechanism of action of meso-dihydroguaiaretic acid on Mycobacterium tuberculosis H37Rv.

.in-collection { color: green; } The ChEMBL database in 2017.

SciCrunch.org Resources

Navigation

Logging in and Registering

Searching

Save Your Search

Query Expansion

Collections

Facets

Options

Further Questions

Publications Per Year

About

Recent News Entries

Contact Us

SciCrunch

New implications on genomic adaptation derived from the Helicobacter pylori genome comparison.

A large-scale crop protection bioassay data set.

Large scale comparison of QSAR and conformal prediction methods and their applications in drug discovery.

The ChEMBL bioactivity database: an update.

Pharos: Collating protein information to shed light on the druggable genome.

UniChem: a unified chemical structure cross-referencing and identifier tracking system.

Improving the odds of drug development success through human genomics: modelling study.

Actionable druggable genome-wide Mendelian randomization identifies repurposing opportunities for COVID-19.

The ChEMBL Database in 2023: a drug discovery platform spanning multiple bioactivity data types and time periods.

The EBI RDF platform: linked open data for the life sciences.

A viral-human interactome based on structural motif-domain interactions captures the human infectome.

The Global Phosphorylation Landscape of SARS-CoV-2 Infection.

Drug mechanism-of-action discovery through the integration of pharmacological and CRISPR screens.

Open Targets: a platform for therapeutic target identification and validation.

Drug Safety Data Curation and Modeling in ChEMBL: Boxed Warnings and Withdrawn Drugs.

Pranlukast Antagonizes CD49f and Reduces Stemness in Triple-Negative Breast Cancer Cells.

A large-scale dataset of in vivo pharmacology assay results.

ChEMBL: towards direct deposition of bioassay data.

Potential mechanism of action of meso-dihydroguaiaretic acid on Mycobacterium tuberculosis H37Rv.

The ChEMBL database in 2017.