Searching across hundreds of databases

Our searching services are busy right now. Your search will reload in five seconds.

X
Forgot Password

If you have forgotten your password you can enter your email here and get a temporary password sent to your email.

X
Forgot Password

If you have forgotten your password you can enter your email here and get a temporary password sent to your email.

This service exclusively searches for literature that cites resources. Please be aware that the total number of searchable documents is limited to those containing RRIDs and does not include all open-access literature.

Search

Type in a keyword to search

On page 1 showing 1 ~ 20 papers out of 289 papers

Updated restraint dictionaries for carbohydrates in the pyranose form.

  • Mihaela Atanasova‎ et al.
  • Acta crystallographica. Section D, Structural biology‎
  • 2022‎

Restraint dictionaries are used during macromolecular structure refinement to encapsulate intramolecular connectivity and geometric information. These dictionaries allow previously determined `ideal' values of features such as bond lengths, angles and torsions to be used as restraint targets. During refinement, restraints influence the model to adopt a conformation that agrees with prior observation. This is especially important when refining crystal structures of glycosylated proteins, as their resolutions tend to be worse than those of nonglycosylated proteins. Pyranosides, the overwhelming majority component in all forms of protein glycosylation, often display conformational errors in crystal structures. Whilst many of these flaws usually relate to model building, refinement issues may also have their root in suboptimal restraint dictionaries. In order to avoid subsequent misinterpretation and to improve the quality of all pyranose monosaccharide entries in the CCP4 Monomer Library, new dictionaries with improved ring torsion restraints, coordinates reflecting the lowest-energy ring pucker and updated geometry have been produced and evaluated. These new dictionaries are now part of the CCP4 Monomer Library and will be released with CCP4 version 8.0.


Constructing Japanese MeSH term dictionaries related to the COVID-19 literature.

  • Atsuko Yamaguchi‎ et al.
  • Genomics & informatics‎
  • 2021‎

The coronavirus disease 2019 (COVID-19) pandemic has led to a flood of research papers and the information has been updated with considerable frequency. For society to derive benefits from this research, it is necessary to promote sharing up-to-date knowledge from these papers. However, because most research papers are written in English, it is difficult for people who are not familiar with English medical terms to obtain knowledge from them. To facilitate sharing knowledge from COVID-19 papers written in English for Japanese speakers, we tried to construct a dictionary with an open license by assigning Japanese terms to MeSH unique identifiers (UIDs) annotated to words in the texts of COVID-19 papers. Using this dictionary, 98.99% of all occurrences of MeSH terms in COVID-19 papers were covered. We also created a curated version of the dictionary and uploaded it to PubDictionary for wider use in the PubAnnotation system.


A document processing pipeline for annotating chemical entities in scientific documents.

  • David Campos‎ et al.
  • Journal of cheminformatics‎
  • 2015‎

The recognition of drugs and chemical entities in text is a very important task within the field of biomedical information extraction, given the rapid growth in the amount of published texts (scientific papers, patents, patient records) and the relevance of these and other related concepts. If done effectively, this could allow exploiting such textual resources to automatically extract or infer relevant information, such as drug profiles, relations and similarities between drugs, or associations between drugs and potential drug targets. The objective of this work was to develop and validate a document processing and information extraction pipeline for the identification of chemical entity mentions in text.


In-silico guided chemical exploration of KDM4A fragments hits.

  • Jessica Lombino‎ et al.
  • Clinical epigenetics‎
  • 2023‎

Lysine demethylase enzymes (KDMs) are an emerging class of therapeutic targets, that catalyse the removal of methyl marks from histone lysine residues regulating chromatin structure and gene expression. KDM4A isoform plays an important role in the epigenetic dysregulation in various cancers and is linked to aggressive disease and poor clinical outcomes. Despite several efforts, the KDM4 family lacks successful specific molecular inhibitors.


Chemical entity recognition in patents by combining dictionary-based and statistical approaches.

  • Saber A Akhondi‎ et al.
  • Database : the journal of biological databases and curation‎
  • 2016‎

We describe the development of a chemical entity recognition system and its application in the CHEMDNER-patent track of BioCreative 2015. This community challenge includes a Chemical Entity Mention in Patents (CEMP) recognition task and a Chemical Passage Detection (CPD) classification task. We addressed both tasks by an ensemble system that combines a dictionary-based approach with a statistical one. For this purpose the performance of several lexical resources was assessed using Peregrine, our open-source indexing engine. We combined our dictionary-based results on the patent corpus with the results of tmChem, a chemical recognizer using a conditional random field classifier. To improve the performance of tmChem, we utilized three additional features, viz. part-of-speech tags, lemmas and word-vector clusters. When evaluated on the training data, our final system obtained an F-score of 85.21% for the CEMP task, and an accuracy of 91.53% for the CPD task. On the test set, the best system ranked sixth among 21 teams for CEMP with an F-score of 86.82%, and second among nine teams for CPD with an accuracy of 94.23%. The differences in performance between the best ensemble system and the statistical system separately were small.Database URL: http://biosemantics.org/chemdner-patents.


Chemical named entity recognition in patents by domain knowledge and unsupervised feature learning.

  • Yaoyun Zhang‎ et al.
  • Database : the journal of biological databases and curation‎
  • 2016‎

Medicinal chemistry patents contain rich information about chemical compounds. Although much effort has been devoted to extracting chemical entities from scientific literature, limited numbers of patent mining systems are publically available, probably due to the lack of large manually annotated corpora. To accelerate the development of information extraction systems for medicinal chemistry patents, the 2015 BioCreative V challenge organized a track on Chemical and Drug Named Entity Recognition from patent text (CHEMDNER patents). This track included three individual subtasks: (i) Chemical Entity Mention Recognition in Patents (CEMP), (ii) Chemical Passage Detection (CPD) and (iii) Gene and Protein Related Object task (GPRO). We participated in the two subtasks of CEMP and CPD using machine learning-based systems. Our machine learning-based systems employed the algorithms of conditional random fields (CRF) and structured support vector machines (SSVMs), respectively. To improve the performance of the NER systems, two strategies were proposed for feature engineering: (i) domain knowledge features of dictionaries, chemical structural patterns and semantic type information present in the context of the candidate chemical and (ii) unsupervised feature learning algorithms to generate word representation features by Brown clustering and a novel binarized Word embedding to enhance the generalizability of the system. Further, the system output for the CPD task was yielded based on the patent titles and abstracts with chemicals recognized in the CEMP task.The effects of the proposed feature strategies on both the machine learning-based systems were investigated. Our best system achieved the second best performance among 21 participating teams in CEMP with a precision of 87.18%, a recall of 90.78% and aF-measure of 88.94% and was the top performing system among nine participating teams in CPD with a sensitivity of 98.60%, a specificity of 87.21%, an accuracy of 94.75%, a Matthew's correlation coefficient (MCC) of 88.24%, a precision at full recall (P_full_R) of 66.57% and an area under the precision-recall curve (AUC_PR) of 0.9347. The SSVM-based CEMP systems outperformed the CRF-based CEMP systems when using the same features. Features generated from both the domain knowledge and unsupervised learning algorithms significantly improved the chemical NER task on patents.Database URL:http:// database. oxfordjournals. org/ content/ 2016/ baw049.


CheNER: a tool for the identification of chemical entities and their classes in biomedical literature.

  • Anabel Usié‎ et al.
  • Journal of cheminformatics‎
  • 2015‎

Small chemical molecules regulate biological processes at the molecular level. Those molecules are often involved in causing or treating pathological states. Automatically identifying such molecules in biomedical text is difficult due to both, the diverse morphology of chemical names and the alternative types of nomenclature that are simultaneously used to describe them. To address these issues, the last BioCreAtIvE challenge proposed a CHEMDNER task, which is a Named Entity Recognition (NER) challenge that aims at labelling different types of chemical names in biomedical text.


An Improved Receptor-Based Pharmacophore Generation Algorithm Guided by Atomic Chemical Characteristics and Hybridization Types.

  • Gaoqi He‎ et al.
  • Frontiers in pharmacology‎
  • 2018‎

Pharmacophore-based virtual screening is an important and leading compound discovery method. However, current pharmacophore generation algorithms suffer from difficulties, such as ligand-dependent computation and massive extractive chemical features. On the basis of the features extracted by the five probes in Pocket v.3, this paper presents an improved receptor-based pharmacophore generation algorithm guided by atomic chemical characteristics and hybridization types. The algorithm works under the constraint of receptor atom hybridization types and space distance. Four chemical characteristics (H-A, H-D, and positive and negative charges) were extracted using the hybridization type of receptor atoms, and the feature point sets were merged with 3 Å space constraints. Furthermore, on the basis of the original extraction of hydrophobic characteristics, extraction of aromatic ring chemical characteristics was achieved by counting the number of aromatics, searching for residual base aromatic ring, and determining the direction of aromatic rings. Accordingly, extraction of six kinds of chemical characteristics of the pharmacophore was achieved. In view of the pharmacophore characteristics, our algorithm was compared with the existing LigandScout algorithm. The results demonstrate that the pharmacophore possessing six chemical characteristics can be characterized using our algorithm, which features fewer pharmacophore characteristics and is ligand independent. The computation of many instances from the directory of useful decoy dataset show that the active molecules and decoy molecules can be effectively differentiated through the presented method in this paper.


Automatic vs. manual curation of a multi-source chemical dictionary: the impact on text mining.

  • Kristina M Hettne‎ et al.
  • Journal of cheminformatics‎
  • 2010‎

Previously, we developed a combined dictionary dubbed Chemlist for the identification of small molecules and drugs in text based on a number of publicly available databases and tested it on an annotated corpus. To achieve an acceptable recall and precision we used a number of automatic and semi-automatic processing steps together with disambiguation rules. However, it remained to be investigated which impact an extensive manual curation of a multi-source chemical dictionary would have on chemical term identification in text. ChemSpider is a chemical database that has undergone extensive manual curation aimed at establishing valid chemical name-to-structure relationships.


Next-generation text-mining mediated generation of chemical response-specific gene sets for interpretation of gene expression data.

  • Kristina M Hettne‎ et al.
  • BMC medical genomics‎
  • 2013‎

Availability of chemical response-specific lists of genes (gene sets) for pharmacological and/or toxic effect prediction for compounds is limited. We hypothesize that more gene sets can be created by next-generation text mining (next-gen TM), and that these can be used with gene set analysis (GSA) methods for chemical treatment identification, for pharmacological mechanism elucidation, and for comparing compound toxicity profiles.


Assessing the Effectiveness of Chemical Marker Extraction from Amazonian Plant Cupuassu (Theobroma grandiflorum) by PSI-HRMS/MS and LC-HRMS/MS.

  • Nerilson M Lima‎ et al.
  • Metabolites‎
  • 2023‎

Employing a combination of liquid chromatography electrospray ionization and paper spray ionization high-resolution tandem mass spectrometry, extracts from cupuassu (Theobroma grandiflorum) pulp prepared with either water, methanol, acetonitrile or combinations thereof were subjected to metabolite fingerprinting. Among the tested extractors, 100% methanol extracted preferentially phenols and cinnamic acids derivatives, whereas acetonitrile and acetonitrile/methanol were more effective in extracting terpenoids and flavonoids, respectively. And while liquid chromatography- mass spectrometry detected twice as many metabolites as paper spray ionization tandem mass spectrometry, the latter proved its potential as a screening technique. Comprehensive structural annotation showed a high production of terpenes, mainly oleanane triterpene derivatives. of the mass spectra Further, five major metabolites with known antioxidant activity, namely catechin, citric acid, epigallocatechin-3'-glucuronide, 5,7,8-trihydroxyflavanone, and asiatic acid, were subjected to molecular docking analysis using the antioxidative enzyme peroxiredoxin 5 (PRDX5) as a model receptor. Based on its excellent docking score, a pharmacophore model of 5,7,8-trihydroxyflavanone was generated, which may help the design of new antioxidants.


An Informatics Approach to Evaluating Combined Chemical Exposures from Consumer Products: A Case Study of Asthma-Associated Chemicals and Potential Endocrine Disruptors.

  • Henry A Gabb‎ et al.
  • Environmental health perspectives‎
  • 2016‎

Simultaneous or sequential exposure to multiple environmental stressors can affect chemical toxicity. Cumulative risk assessments consider multiple stressors but it is impractical to test every chemical combination to which people are exposed. New methods are needed to prioritize chemical combinations based on their prevalence and possible health impacts.


Leveraging word embeddings and medical entity extraction for biomedical dataset retrieval using unstructured texts.

  • Yanshan Wang‎ et al.
  • Database : the journal of biological databases and curation‎
  • 2017‎

The recent movement towards open data in the biomedical domain has generated a large number of datasets that are publicly accessible. The Big Data to Knowledge data indexing project, biomedical and healthCAre Data Discovery Index Ecosystem (bioCADDIE), has gathered these datasets in a one-stop portal aiming at facilitating their reuse for accelerating scientific advances. However, as the number of biomedical datasets stored and indexed increases, it becomes more and more challenging to retrieve the relevant datasets according to researchers' queries. In this article, we propose an information retrieval (IR) system to tackle this problem and implement it for the bioCADDIE Dataset Retrieval Challenge. The system leverages the unstructured texts of each dataset including the title and description for the dataset, and utilizes a state-of-the-art IR model, medical named entity extraction techniques, query expansion with deep learning-based word embeddings and a re-ranking strategy to enhance the retrieval performance. In empirical experiments, we compared the proposed system with 11 baseline systems using the bioCADDIE Dataset Retrieval Challenge datasets. The experimental results show that the proposed system outperforms other systems in terms of inference Average Precision and inference normalized Discounted Cumulative Gain, implying that the proposed system is a viable option for biomedical dataset retrieval. Database URL: https://github.com/yanshanwang/biocaddie2016mayodata.


Comparison of concept recognizers for building the Open Biomedical Annotator.

  • Nigam H Shah‎ et al.
  • BMC bioinformatics‎
  • 2009‎

The National Center for Biomedical Ontology (NCBO) is developing a system for automated, ontology-based access to online biomedical resources (Shah NH, et al.: Ontology-driven indexing of public datasets for translational bioinformatics. BMC Bioinformatics 2009, 10(Suppl 2):S1). The system's indexing workflow processes the text metadata of diverse resources such as datasets from GEO and ArrayExpress to annotate and index them with concepts from appropriate ontologies. This indexing requires the use of a concept-recognition tool to identify ontology concepts in the resource's textual metadata. In this paper, we present a comparison of two concept recognizers - NLM's MetaMap and the University of Michigan's Mgrep. We utilize a number of data sources and dictionaries to evaluate the concept recognizers in terms of precision, recall, speed of execution, scalability and customizability. Our evaluations demonstrate that Mgrep has a clear edge over MetaMap for large-scale service oriented applications. Based on our analysis we also suggest areas of potential improvements for Mgrep. We have subsequently used Mgrep to build the Open Biomedical Annotator service. The Annotator service has access to a large dictionary of biomedical terms derived from the United Medical Language System (UMLS) and NCBO ontologies. The Annotator also leverages the hierarchical structure of the ontologies and their mappings to expand annotations. The Annotator service is available to the community as a REST Web service for creating ontology-based annotations of their data.


A text mining approach to detect mentions of protein glycosylation in biomedical text.

  • Daksha Shukla‎ et al.
  • Bioinformation‎
  • 2012‎

Protein Glycosylation is an important post translational event that plays a pivotal role in protein folding and protein is trafficking. We describe a dictionary based and a rule based approach to mine 'mentions' of protein glycosylation in text. The dictionary based approach relies on a set of manually curated dictionaries specially constructed to address this task. Abstracts are then screened for the 'mentions' of words from these dictionaries which are further scored followed by classification on the basis of a threshold. The rule based approaches also relies on the words in the dictionary to arrive at the features which are used for classification. The performance of the system using both the approaches has been evaluated using a manually curated corpus of 3133 abstracts. The evaluation suggests that the performance of the Rule based approach supersedes that of the Dictionary based approach.


DESTAF: a database of text-mined associations for reproductive toxins potentially affecting human fertility.

  • Adam S Dawe‎ et al.
  • Reproductive toxicology (Elmsford, N.Y.)‎
  • 2012‎

The Dragon Exploration System for Toxicants and Fertility (DESTAF) is a publicly available resource which enables researchers to efficiently explore both known and potentially novel information and associations in the field of reproductive toxicology. To create DESTAF we used data from the literature (including over 10500 PubMed abstracts), several publicly available biomedical repositories, and specialized, curated dictionaries. DESTAF has an interface designed to facilitate rapid assessment of the key associations between relevant concepts, allowing for a more in-depth exploration of information based on different gene/protein-, enzyme/metabolite-, toxin/chemical-, disease- or anatomically centric perspectives. As a special feature, DESTAF allows for the creation and initial testing of potentially new association hypotheses that suggest links between biological entities identified through the database. DESTAF, along with a PDF manual, can be found at http://cbrc.kaust.edu.sa/destaf. It is free to academic and non-commercial users and will be updated quarterly.


The current understanding of knowledge management concepts: A critical review.

  • Shahram Yazdani‎ et al.
  • Medical journal of the Islamic Republic of Iran‎
  • 2020‎

Background: Higher education institutions include experts who are knowledgeable. Knowledge management facilitates institutions to enhance the capacity to collect information and knowledge and apply it to problem-solving and decision making. Through the review of related studies, we observed that there are multiple concepts and terms in the field of knowledge management. Thus, the complexity and variety of these concepts and definitions must be clarified. Considering the importance of clarifying these concepts for utilization by users, this study aimed to examine the concepts related to this filed. Methods: The methodology used in this study was based on the Carnwell and Daly's critical review method. An extensive search was carried out on various databases and libraries. A critical and profound review was carried out on selected articles. Many wandering concepts were found. Identified concepts were classified into seven categories based on conceptual proximity. Existing definitions and evidence in relation to extracted concepts were criticized and synthesized. The definitional attributes for them were identified and a conceptual identity card was provided for each of the concepts. Results: Thirty-seven concepts with the most relevance to the field of knowledge management were extracted. There was no clear boundary among them, and they wandered. To avoid more confusion, concepts were classified according to semantic relation. Eight categories were created; each category consisted of a mother concept and several other concepts with similarity and proximity to the meaning of the original concept. Their attributes have been identified, and finally, each of them was presented in the form of a conceptual identity card. Conclusion: Through critically reviewing the literature in this field, we were able to identify the concepts and realize their attributes. In this way, we came to a new interpretation of the concepts. At the end of the study, we concluded that some of the concepts have not been properly defined and are not properly located in the knowledge management field; also their application is uncertain.


Taking the responsibility in dementia care: A concept analysis about facticity.

  • Célia Pereira Caldas‎ et al.
  • Nursing open‎
  • 2018‎

The aim of this study is to develop a comprehensive definition of facticity, applicable to dementia nursing.


The Quixote project: Collaborative and Open Quantum Chemistry data management in the Internet age.

  • Sam Adams‎ et al.
  • Journal of cheminformatics‎
  • 2011‎

Computational Quantum Chemistry has developed into a powerful, efficient, reliable and increasingly routine tool for exploring the structure and properties of small to medium sized molecules. Many thousands of calculations are performed every day, some offering results which approach experimental accuracy. However, in contrast to other disciplines, such as crystallography, or bioinformatics, where standard formats and well-known, unified databases exist, this QC data is generally destined to remain locally held in files which are not designed to be machine-readable. Only a very small subset of these results will become accessible to the wider community through publication.In this paper we describe how the Quixote Project is developing the infrastructure required to convert output from a number of different molecular quantum chemistry packages to a common semantically rich, machine-readable format and to build respositories of QC results. Such an infrastructure offers benefits at many levels. The standardised representation of the results will facilitate software interoperability, for example making it easier for analysis tools to take data from different QC packages, and will also help with archival and deposition of results. The repository infrastructure, which is lightweight and built using Open software components, can be implemented at individual researcher, project, organisation or community level, offering the exciting possibility that in future many of these QC results can be made publically available, to be searched and interpreted just as crystallography and bioinformatics results are today.Although we believe that quantum chemists will appreciate the contribution the Quixote infrastructure can make to the organisation and and exchange of their results, we anticipate that greater rewards will come from enabling their results to be consumed by a wider community. As the respositories grow they will become a valuable source of chemical data for use by other disciplines in both research and education.The Quixote project is unconventional in that the infrastructure is being implemented in advance of a full definition of the data model which will eventually underpin it. We believe that a working system which offers real value to researchers based on tools and shared, searchable repositories will encourage early participation from a broader community, including both producers and consumers of data. In the early stages, searching and indexing can be performed on the chemical subject of the calculations, and well defined calculation meta-data. The process of defining more specific quantum chemical definitions, adding them to dictionaries and extracting them consistently from the results of the various software packages can then proceed in an incremental manner, adding additional value at each stage.Not only will these results help to change the data management model in the field of Quantum Chemistry, but the methodology can be applied to other pressing problems related to data in computational and experimental science.


Rational design of small molecules able to inhibit α-synuclein amyloid aggregation for the treatment of Parkinson's disease.

  • Serena Vittorio‎ et al.
  • Journal of enzyme inhibition and medicinal chemistry‎
  • 2020‎

Parkinson's disease is one of the most common neurodegenerative disorders in elderly age. One of the mechanisms involved in the neurodegeneration appears related to the aggregation of the presynaptic protein alpha synuclein (α-syn) into toxic oligomers and fibrils. To date, no highly effective treatment is currently available; therefore, there is an increasing interest in the search of new therapeutic tools. The modulation of α-syn aggregation represents an emergent and promising disease-modifying strategy for reducing or blocking the neurodegenerative process. Herein, by combining in silico and in vitro screenings we initially identified 3-(cinnamylsulfanyl)-5-(4-pyridinyl)-1,2,4-triazol-4-amine (3) as α-syn aggregation inhibitor that was then considered a promising hit for the further design of a new series of small molecules. Therefore, we rationally designed new hit-derivatives that were synthesised and evaluated by biological assays. Lastly, the binding mode of the newer inhibitors was predicted by docking studies.


  1. SciCrunch.org Resources

    Welcome to the FDI Lab - SciCrunch.org Resources search. From here you can search through a compilation of resources used by FDI Lab - SciCrunch.org and see how data is organized within our community.

  2. Navigation

    You are currently on the Community Resources tab looking through categories and sources that FDI Lab - SciCrunch.org has compiled. You can navigate through those categories from here or change to a different tab to execute your search through. Each tab gives a different perspective on data.

  3. Logging in and Registering

    If you have an account on FDI Lab - SciCrunch.org then you can log in from here to get additional features in FDI Lab - SciCrunch.org such as Collections, Saved Searches, and managing Resources.

  4. Searching

    Here is the search term that is being executed, you can type in anything you want to search for. Some tips to help searching:

    1. Use quotes around phrases you want to match exactly
    2. You can manually AND and OR terms to change how we search between words
    3. You can add "-" to terms to make sure no results return with that term in them (ex. Cerebellum -CA1)
    4. You can add "+" to terms to require they be in the data
    5. Using autocomplete specifies which branch of our semantics you with to search and can help refine your search
  5. Save Your Search

    You can save any searches you perform for quick access to later from here.

  6. Query Expansion

    We recognized your search term and included synonyms and inferred terms along side your term to help get the data you are looking for.

  7. Collections

    If you are logged into FDI Lab - SciCrunch.org you can add data records to your collections to create custom spreadsheets across multiple sources of data.

  8. Facets

    Here are the facets that you can filter your papers by.

  9. Options

    From here we'll present any options for the literature, such as exporting your current results.

  10. Further Questions

    If you have any further questions please check out our FAQs Page to ask questions and see our tutorials. Click this button to view this tutorial again.

Publications Per Year

X

Year:

Count: