FDI Lab - SciCrunch.org | Searching for in Literature

Finding biomedical categories in Medline®.

Lana Yeganova‎ et al.
Journal of biomedical semantics‎
2012‎

There are several humanly defined ontologies relevant to Medline. However, Medline is a fast growing collection of biomedical documents which creates difficulties in updating and expanding these humanly defined ontologies. Automatically identifying meaningful categories of entities in a large text corpus is useful for information extraction, construction of machine learning features, and development of semantic representations. In this paper we describe and compare two methods for automatically learning meaningful biomedical categories in Medline. The first approach is a simple statistical method that uses part-of-speech and frequency information to extract a list of frequent nouns from Medline. The second method implements an alignment-based technique to learn frequent generic patterns that indicate a hyponymy/hypernymy relationship between a pair of noun phrases. We then apply these patterns to Medline to collect frequent hypernyms as potential biomedical categories.

Search results outliers among MEDLINE platforms.

Christopher Sean Burns‎ et al.
Journal of the Medical Library Association : JMLA‎
2019‎

Hypothetically, content in MEDLINE records is consistent across multiple platforms. Though platforms have different interfaces and requirements for query syntax, results should be similar when the syntax is controlled for across the platforms. The authors investigated how search result counts varied when searching records among five MEDLINE platforms.

Has Embase replaced MEDLINE since coverage expansion?

Michael Thomas Lam‎ et al.
Journal of the Medical Library Association : JMLA‎
2018‎

The research tested the authors' hypothesis that more researchers from the academic medicine community in the United States and Canada with institutional access to Embase had started using Embase to replace MEDLINE since Embase was expanded in 2010 to cover all MEDLINE records.

Finding related sentence pairs in MEDLINE.

Larry H Smith‎ et al.
Information retrieval‎
2010‎

We explore the feasibility of automatically identifying sentences in different MEDLINE abstracts that are related in meaning. We compared traditional vector space models with machine learning methods for detecting relatedness, and found that machine learning was superior. The Huber method, a variant of Support Vector Machines which minimizes the modified Huber loss function, achieves 73% precision when the score cutoff is set high enough to identify about one related sentence per abstract on average. We illustrate how an abstract viewed in PubMed might be modified to present the related sentences found in other abstracts by this automatic procedure.

Evaluating relevance ranking strategies for MEDLINE retrieval.

Zhiyong Lu‎ et al.
Journal of the American Medical Informatics Association : JAMIA‎
2009‎

This paper evaluates the retrieval effectiveness of relevance ranking strategies on a collection of 55 queries and about 160,000 MEDLINE((R)) citations used in the 2006 and 2007 Text Retrieval Conference (TREC) Genomics Tracks. The authors study two relevance ranking strategies: term frequency-inverse document frequency (TF-IDF) weighting and sentence-level co-occurrence, and examine their ability to rank retrieved MEDLINE documents given user queries. Furthermore, the authors use the reverse chronological order-PubMed's default display option-as a baseline for comparison. Retrieval effectiveness is assessed using both mean average precision and mean rank precision. Experimental results show that retrievals based on the two strategies had improved performance over the baseline performance, and that TF-IDF weighting is more effective in retrieving relevant documents based on the comparison between the two strategies.

Bibliometric analysis of leishmaniasis research in Medline (1945-2010).

José M Ramos‎ et al.
Parasites & vectors‎
2013‎

Publications are often used as a measure of success of research work. Leishmaniasis is considered endemic in 98 countries, most of which are developing. This article describes a bibliometric review of the literature on leishmaniasis research indexed in PubMed during a 66-year period.

Feature engineering for MEDLINE citation categorization with MeSH.

Antonio Jose Jimeno Yepes‎ et al.
BMC bioinformatics‎
2015‎

Research in biomedical text categorization has mostly used the bag-of-words representation. Other more sophisticated representations of text based on syntactic, semantic and argumentative properties have been less studied. In this paper, we evaluate the impact of different text representations of biomedical texts as features for reproducing the MeSH annotations of some of the most frequent MeSH headings. In addition to unigrams and bigrams, these features include noun phrases, citation meta-data, citation structure, and semantic annotation of the citations.

Search filters to identify geriatric medicine in Medline.

Esther M M van de Glind‎ et al.
Journal of the American Medical Informatics Association : JAMIA‎
2012‎

To create user-friendly search filters with high sensitivity, specificity, and precision to identify articles on geriatric medicine in Medline.

PCorral--interactive mining of protein interactions from MEDLINE.

Chen Li‎ et al.
Database : the journal of biological databases and curation‎
2013‎

The extraction of information from the scientific literature is a complex task-for researchers doing manual curation and for automatic text processing solutions. The identification of protein-protein interactions (PPIs) requires the extraction of protein named entities and their relations. Semi-automatic interactive support is one approach to combine both solutions for efficient working processes to generate reliable database content. In principle, the extraction of PPIs can be achieved with different methods that can be combined to deliver high precision and/or high recall results in different combinations at the same time. Interactive use can be achieved, if the analytical methods are fast enough to process the retrieved documents. PCorral provides interactive mining of PPIs from the scientific literature allowing curators to skim MEDLINE for PPIs at low overheads. The keyword query to PCorral steers the selection of documents, and the subsequent text analysis generates high recall and high precision results for the curator. The underlying components of PCorral process the documents on-the-fly and are available, as well, as web service from the Whatizit infrastructure. The human interface summarizes the identified PPI results, and the involved entities are linked to relevant resources and databases. Altogether, PCorral serves curator at both the beginning and the end of the curation workflow for information retrieval and information extraction. Database URL: http://www.ebi.ac.uk/Rebholz-srv/pcorral.

How to Prepare Endocrinology and Metabolism for Reapplication to MEDLINE.

Sun Huh‎
Endocrinology and metabolism (Seoul, Korea)‎
2017‎

No abstract available

Tools for loading MEDLINE into a local relational database.

Diane E Oliver‎ et al.
BMC bioinformatics‎
2004‎

Researchers who use MEDLINE for text mining, information extraction, or natural language processing may benefit from having a copy of MEDLINE that they can manage locally. The National Library of Medicine (NLM) distributes MEDLINE in eXtensible Markup Language (XML)-formatted text files, but it is difficult to query MEDLINE in that format. We have developed software tools to parse the MEDLINE data files and load their contents into a relational database. Although the task is conceptually straightforward, the size and scope of MEDLINE make the task nontrivial. Given the increasing importance of text analysis in biology and medicine, we believe a local installation of MEDLINE will provide helpful computing infrastructure for researchers.

Wide-coverage relation extraction from MEDLINE using deep syntax.

Nhung T H Nguyen‎ et al.
BMC bioinformatics‎
2015‎

Relation extraction is a fundamental technology in biomedical text mining. Most of the previous studies on relation extraction from biomedical literature have focused on specific or predefined types of relations, which inherently limits the types of the extracted relations. With the aim of fully leveraging the knowledge described in the literature, we address much broader types of semantic relations using a single extraction framework.

CoPub Mapper: mining MEDLINE based on search term co-publication.

Blaise T F Alako‎ et al.
BMC bioinformatics‎
2005‎

High throughput microarray analyses result in many differentially expressed genes that are potentially responsible for the biological process of interest. In order to identify biological similarities between genes, publications from MEDLINE were identified in which pairs of gene names and combinations of gene name with specific keywords were co-mentioned.

DigChem: Identification of disease-gene-chemical relationships from Medline abstracts.

Jeongkyun Kim‎ et al.
PLoS computational biology‎
2019‎

Chemicals interact with genes in the process of disease development and treatment. Although much biomedical research has been performed to understand relationships among genes, chemicals, and diseases, which have been reported in biomedical articles in Medline, there are few studies that extract disease-gene-chemical relationships from biomedical literature at a PubMed scale. In this study, we propose a deep learning model based on bidirectional long short-term memory to identify the evidence sentences of relationships among genes, chemicals, and diseases from Medline abstracts. Then, we develop the search engine DigChem to enable disease-gene-chemical relationship searches for 35,124 genes, 56,382 chemicals, and 5,675 diseases. We show that the identified relationships are reliable by comparing them with manual curation and existing databases. DigChem is available at http://gcancer.org/digchem.

Identifying observational studies of surgical interventions in MEDLINE and EMBASE.

Cynthia Fraser‎ et al.
BMC medical research methodology‎
2006‎

Health technology assessments of surgical interventions frequently require the inclusion of non-randomised evidence. Literature search strategies employed to identify this evidence often exclude a methodological component because of uncertainty surrounding the use of appropriate search terms. This can result in the retrieval of a large number of irrelevant records. Methodological filters would help to minimise this, making literature searching more efficient.

Correcting duplicate publications: follow up study of MEDLINE tagged duplications.

Mario Malički‎ et al.
Biochemia medica‎
2019‎

As MEDLINE indexers tag similar articles as duplicates even when journals have not addressed the duplication(s), we sought to determine the reasons behind the tagged duplications, and if the journals had undertaken or had planned to undertake any actions to address them.

An analysis of disease-gene relationship from Medline abstracts by DigSee.

Jeongkyun Kim‎ et al.
Scientific reports‎
2017‎

Diseases are developed by abnormal behavior of genes in biological events such as gene regulation, mutation, phosphorylation, and epigenetics and post-translational modification. Many studies of text mining attempted to identify the relationship between gene and disease by mining the literature, but they did not consider the biological events in which genes show abnormal behaviour in response to diseases. In this study, we propose to identify disease-related genes that are involved in the development of disease through biological events from Medline abstracts. We identified associations between 13,054 genes and 4,494 disease types, which cover more disease-related genes than manually curated databases for all disease types (e.g., Online Mendelian Inheritance in Man) and also than those for specific diseases (e.g., Alzheimer's disease and hypertension). We show that the text mining findings are reliable, as per the PubMed scale, in that the disease-disease relationships inferred from the literature-wide findings are similar to those inferred from manually curated databases in a well-known study. In addition, literature-wide distribution of biological events across disease types reveals different characteristics of disease types.

PRIORI-T: A tool for rare disease gene prioritization using MEDLINE.

Aditya Rao‎ et al.
PloS one‎
2020‎

Phenotype-driven rare disease gene prioritization relies on high quality curated resources containing disease, gene and phenotype annotations. However, the effectiveness of gene prioritization tools is constrained by the incomplete coverage of rare disease, phenotype and gene annotations in such curated resources.

A novel algorithm for analyzing drug-drug interactions from MEDLINE literature.

Yin Lu‎ et al.
Scientific reports‎
2015‎

Drug-drug interaction (DDI) is becoming a serious clinical safety issue as the use of multiple medications becomes more common. Searching the MEDLINE database for journal articles related to DDI produces over 330,000 results. It is impossible to read and summarize these references manually. As the volume of biomedical reference in the MEDLINE database continues to expand at a rapid pace, automatic identification of DDIs from literature is becoming increasingly important. In this article, we present a random-sampling-based statistical algorithm to identify possible DDIs and the underlying mechanism from the substances field of MEDLINE records. The substances terms are essentially carriers of compound (including protein) information in a MEDLINE record. Four case studies on warfarin, ibuprofen, furosemide and sertraline implied that our method was able to rank possible DDIs with high accuracy (90.0% for warfarin, 83.3% for ibuprofen, 70.0% for furosemide and 100% for sertraline in the top 10% of a list of compounds ranked by p-value). A social network analysis of substance terms was also performed to construct networks between proteins and drug pairs to elucidate how the two drugs could interact.

Research articles on volunteering in biomedical journals: a MEDLINE-based bibliometric analysis.

Bo-Ren Cheng‎ et al.
The Journal of international medical research‎
2020‎

Volunteering, an important aspect of society, is a multidisciplinary issue. However, there few are extensive surveys examining the trend and scope of research on volunteering in biomedical areas.

Searching across hundreds of databases

Our searching services are busy right now. Your search will reload in five seconds.

Finding biomedical categories in Medline®.

Search results outliers among MEDLINE platforms.

Has Embase replaced MEDLINE since coverage expansion?

Finding related sentence pairs in MEDLINE.

Evaluating relevance ranking strategies for MEDLINE retrieval.

Bibliometric analysis of leishmaniasis research in Medline (1945-2010).

Feature engineering for MEDLINE citation categorization with MeSH.

Search filters to identify geriatric medicine in Medline.

PCorral--interactive mining of protein interactions from MEDLINE.

How to Prepare Endocrinology and Metabolism for Reapplication to MEDLINE.

Tools for loading MEDLINE into a local relational database.

Wide-coverage relation extraction from MEDLINE using deep syntax.

CoPub Mapper: mining MEDLINE based on search term co-publication.

DigChem: Identification of disease-gene-chemical relationships from Medline abstracts.

Identifying observational studies of surgical interventions in MEDLINE and EMBASE.

Correcting duplicate publications: follow up study of MEDLINE tagged duplications.

An analysis of disease-gene relationship from Medline abstracts by DigSee.

PRIORI-T: A tool for rare disease gene prioritization using MEDLINE.

A novel algorithm for analyzing drug-drug interactions from MEDLINE literature.

Research articles on volunteering in biomedical journals: a MEDLINE-based bibliometric analysis.

SciCrunch.org Resources

Navigation

Logging in and Registering

Searching

Save Your Search

Query Expansion

Collections

Facets

Options

Further Questions

About

Recent News Entries

Contact Us

SciCrunch

Searching across hundreds of databases

Our searching services are busy right now. Your search will reload in five seconds.

Log in

Log in

Literature

Options

Facets

Recent searches

.in-collection { color: green; } Finding biomedical categories in Medline®.

.in-collection { color: green; } Search results outliers among MEDLINE platforms.

.in-collection { color: green; } Has Embase replaced MEDLINE since coverage expansion?

.in-collection { color: green; } Finding related sentence pairs in MEDLINE.

.in-collection { color: green; } Evaluating relevance ranking strategies for MEDLINE retrieval.

.in-collection { color: green; } Bibliometric analysis of leishmaniasis research in Medline (1945-2010).

.in-collection { color: green; } Feature engineering for MEDLINE citation categorization with MeSH.

.in-collection { color: green; } Search filters to identify geriatric medicine in Medline.

.in-collection { color: green; } PCorral--interactive mining of protein interactions from MEDLINE.

.in-collection { color: green; } How to Prepare Endocrinology and Metabolism for Reapplication to MEDLINE.

.in-collection { color: green; } Tools for loading MEDLINE into a local relational database.

.in-collection { color: green; } Wide-coverage relation extraction from MEDLINE using deep syntax.

.in-collection { color: green; } CoPub Mapper: mining MEDLINE based on search term co-publication.

.in-collection { color: green; } DigChem: Identification of disease-gene-chemical relationships from Medline abstracts.

.in-collection { color: green; } Identifying observational studies of surgical interventions in MEDLINE and EMBASE.

.in-collection { color: green; } Correcting duplicate publications: follow up study of MEDLINE tagged duplications.

.in-collection { color: green; } An analysis of disease-gene relationship from Medline abstracts by DigSee.

.in-collection { color: green; } PRIORI-T: A tool for rare disease gene prioritization using MEDLINE.

.in-collection { color: green; } A novel algorithm for analyzing drug-drug interactions from MEDLINE literature.

.in-collection { color: green; } Research articles on volunteering in biomedical journals: a MEDLINE-based bibliometric analysis.

SciCrunch.org Resources

Navigation

Logging in and Registering

Searching

Save Your Search

Query Expansion

Collections

Facets

Options

Further Questions

Publications Per Year

About

Recent News Entries

Contact Us

SciCrunch

Finding biomedical categories in Medline®.

Search results outliers among MEDLINE platforms.

Has Embase replaced MEDLINE since coverage expansion?

Finding related sentence pairs in MEDLINE.

Evaluating relevance ranking strategies for MEDLINE retrieval.

Bibliometric analysis of leishmaniasis research in Medline (1945-2010).

Feature engineering for MEDLINE citation categorization with MeSH.

Search filters to identify geriatric medicine in Medline.

PCorral--interactive mining of protein interactions from MEDLINE.

How to Prepare Endocrinology and Metabolism for Reapplication to MEDLINE.

Tools for loading MEDLINE into a local relational database.

Wide-coverage relation extraction from MEDLINE using deep syntax.

CoPub Mapper: mining MEDLINE based on search term co-publication.

DigChem: Identification of disease-gene-chemical relationships from Medline abstracts.

Identifying observational studies of surgical interventions in MEDLINE and EMBASE.

Correcting duplicate publications: follow up study of MEDLINE tagged duplications.

An analysis of disease-gene relationship from Medline abstracts by DigSee.

PRIORI-T: A tool for rare disease gene prioritization using MEDLINE.

A novel algorithm for analyzing drug-drug interactions from MEDLINE literature.

Research articles on volunteering in biomedical journals: a MEDLINE-based bibliometric analysis.