Searching across hundreds of databases

Our searching services are busy right now. Your search will reload in five seconds.

X
Forgot Password

If you have forgotten your password you can enter your email here and get a temporary password sent to your email.

X
Forgot Password

If you have forgotten your password you can enter your email here and get a temporary password sent to your email.

metabolicMine: an integrated genomics, genetics and proteomics data warehouse for common metabolic disease research.

Database : the journal of biological databases and curation | 2013

Common metabolic and endocrine diseases such as diabetes affect millions of people worldwide and have a major health impact, frequently leading to complications and mortality. In a search for better prevention and treatment, there is ongoing research into the underlying molecular and genetic bases of these complex human diseases, as well as into the links with risk factors such as obesity. Although an increasing number of relevant genomic and proteomic data sets have become available, the quantity and diversity of the data make their efficient exploitation challenging. Here, we present metabolicMine, a data warehouse with a specific focus on the genomics, genetics and proteomics of common metabolic diseases. Developed in collaboration with leading UK metabolic disease groups, metabolicMine integrates data sets from a range of experiments and model organisms alongside tools for exploring them. The current version brings together information covering genes, proteins, orthologues, interactions, gene expression, pathways, ontologies, diseases, genome-wide association studies and single nucleotide polymorphisms. Although the emphasis is on human data, key data sets from mouse and rat are included. These are complemented by interoperation with the RatMine rat genomics database, with a corresponding mouse version under development by the Mouse Genome Informatics (MGI) group. The web interface contains a number of features including keyword search, a library of Search Forms, the QueryBuilder and list analysis tools. This provides researchers with many different ways to analyse, view and flexibly export data. Programming interfaces and automatic code generation in several languages are supported, and many of the features of the web interface are available through web services. The combination of diverse data sets integrated with analysis tools and a powerful query system makes metabolicMine a valuable research resource. The web interface makes it accessible to first-time users, whereas the Application Programming Interface (API) and web services provide convenient data access and tools for bioinformaticians. metabolicMine is freely available online at http://www.metabolicmine.org Database URL: http://www.metabolicmine.org.

Pubmed ID: 23935057 RIS Download

Publication data is provided by the National Library of Medicine ® and PubMed ®. Data is retrieved from PubMed ® on a weekly schedule. For terms and conditions see the National Library of Medicine Terms and Conditions.

This is a list of tools and resources that we have found mentioned in this publication.


Ensembl (tool)

RRID:SCR_002344

Collection of genome databases for vertebrates and other eukaryotic species with DNA and protein sequence search capabilities. Used to automatically annotate genome, integrate this annotation with other available biological data and make data publicly available via web. Ensembl tools include BLAST, BLAT, BioMart and the Variant Effect Predictor (VEP) for all supported species.

View all literature mentions

UniProt (tool)

RRID:SCR_002380

Collection of data of protein sequence and functional information. Resource for protein sequence and annotation data. Consortium for preservation of the UniProt databases: UniProt Knowledgebase (UniProtKB), UniProt Reference Clusters (UniRef), and UniProt Archive (UniParc), UniProt Proteomes. Collaboration between European Bioinformatics Institute (EMBL-EBI), SIB Swiss Institute of Bioinformatics and Protein Information Resource. Swiss-Prot is a curated subset of UniProtKB.

View all literature mentions

Entrez Gene (tool)

RRID:SCR_002473

Database for genomes that have been completely sequenced, have active research community to contribute gene-specific information, or that are scheduled for intense sequence analysis. Includes nomenclature, map location, gene products and their attributes, markers, phenotypes, and links to citations, sequences, variation details, maps, expression, homologs, protein domains and external databases. All entries follow NCBI's format for data collections. Content of Entrez Gene represents result of curation and automated integration of data from NCBI's Reference Sequence project (RefSeq), from collaborating model organism databases, and from many other databases available from NCBI. Records are assigned unique, stable and tracked integers as identifiers. Content is updated as new information becomes available.

View all literature mentions

Gene Ontology (tool)

RRID:SCR_002811

Computable knowledge regarding functions of genes and gene products. GO resources include biomedical ontologies that cover molecular domains of all life forms as well as extensive compilations of gene product annotations to these ontologies that provide largely species-neutral, comprehensive statements about what gene products do. Used to standardize representation of gene and gene product attributes across species and databases.

View all literature mentions

ArrayExpress (tool)

RRID:SCR_002964

International functional genomics data collection generated from microarray or next-generation sequencing (NGS) platforms. Repository of functional genomics data supporting publications. Provides genes expression data for reuse to the research community where they can be queried and downloaded. Integrated with the Gene Expression Atlas and the sequence databases at the European Bioinformatics Institute. Contains a subset of curated and re-annotated Archive data which can be queried for individual gene expression under different biological conditions across experiments. Data collected to MIAME and MINSEQE standards. Data are submitted by users or are imported directly from the NCBI Gene Expression Omnibus.

View all literature mentions

Reactome (tool)

RRID:SCR_003485

Collection of pathways and pathway annotations. The core unit of the Reactome data model is the reaction. Entities (nucleic acids, proteins, complexes and small molecules) participating in reactions form a network of biological interactions and are grouped into pathways (signaling, innate and acquired immune function, transcriptional regulation, translation, apoptosis and classical intermediary metabolism) . Provides website to navigate pathway knowledge and a suite of data analysis tools to support the pathway-based analysis of complex experimental and computational data sets.

View all literature mentions

OMIM (tool)

RRID:SCR_006437

Online catalog of human genes and genetic disorders, for clinical features, phenotypes and genes. Collection of human genes and genetic phenotypes, focusing on relationship between phenotype and genotype. Referenced overviews in OMIM contain information on all known mendelian disorders and variety of related genes. It is updated daily, and entries contain copious links to other genetics resources.

View all literature mentions

InterPro (tool)

RRID:SCR_006695

Service providing functional analysis of proteins by classifying them into families and predicting domains and important sites. They combine protein signatures from a number of member databases into a single searchable resource, capitalizing on their individual strengths to produce a powerful integrated database and diagnostic tool. This integrated database of predictive protein signatures is used for the classification and automatic annotation of proteins and genomes. InterPro classifies sequences at superfamily, family and subfamily levels, predicting the occurrence of functional domains, repeats and important sites. InterPro adds in-depth annotation, including GO terms, to the protein signatures. You can access the data programmatically, via Web Services. The member databases use a number of approaches: # ProDom: provider of sequence-clusters built from UniProtKB using PSI-BLAST. # PROSITE patterns: provider of simple regular expressions. # PROSITE and HAMAP profiles: provide sequence matrices. # PRINTS provider of fingerprints, which are groups of aligned, un-weighted Position Specific Sequence Matrices (PSSMs). # PANTHER, PIRSF, Pfam, SMART, TIGRFAMs, Gene3D and SUPERFAMILY: are providers of hidden Markov models (HMMs). Your contributions are welcome. You are encouraged to use the ''''Add your annotation'''' button on InterPro entry pages to suggest updated or improved annotation for individual InterPro entries.

View all literature mentions

HPA (tool)

RRID:SCR_006710

Public database with millions of high-resolution images showing the spatial distribution of proteins in different normal human tissues and cancer types, as well as different human cell lines. The data is released together with application-specific validation performed for each antibody, including immunohistochemisty, Western blot analysis and, for a large fraction, a protein array assay and immunofluorescent based confocal microscopy. The database has been developed in a gene-centric manner with the inclusion of all human genes predicted from genome efforts. Search functionalities allow for complex queries regarding protein expression profiles, protein classes and chromosome location. Antibodies included have been analyzed using a standardized protocol in a single attempt without further efforts to optimize the procedure and therefore it cannot be excluded that certain observed binding properties are due to technical rather than biological reasons and that further optimization could result in a different outcome. Submission of antibodies: The Swedish Human Proteome Atlas (HPA) program, invites submission of antibodies from both academic and commercial sources to be included in the human protein atlas. All antibodies will be validated by the HPA-program by a standard procedure and antibodies that are accepted will be use in the tissue- profiling program to generate high-resolution immunohistochemistry images representing a wide spectrum of normal tissues and cancer types.

View all literature mentions

IntAct (tool)

RRID:SCR_006944

Open source database system and analysis tools for molecular interaction data. All interactions are derived from literature curation or direct user submissions. Direct user submissions of molecular interaction data are encouraged, which may be deposited prior to publication in a peer-reviewed journal. The IntAct Database contains (Jun. 2014): * 447368 Interactions * 33021 experiments * 12698 publications * 82745 Interactors IntAct provides a two-tiered view of the interaction data. The search interface allows the user to iteratively develop complex queries, exploiting the detailed annotation with hierarchical controlled vocabularies. Results are provided at any stage in a simplified, tabular view. Specialized views then allows "zooming in" on the full annotation of interactions, interactors and their properties. IntAct source code and data are freely available.

View all literature mentions

OBO (tool)

RRID:SCR_007083

A collaboration involving developers of science-based ontologies who are establishing a set of principles for ontology development with the goal of creating a suite of orthogonal interoperable reference ontologies in the biomedical domain. In addition to a listing of OBO ontologies, this site provides a statement of the OBO Foundry principles, discussion fora, technical infrastructure, and other services to facilitate ontology development. Feedback is welcome and participation encouraged.

View all literature mentions

Biological General Repository for Interaction Datasets (BioGRID) (tool)

RRID:SCR_007393

Curated protein-protein and genetic interaction repository of raw protein and genetic interactions from major model organism species, with data compiled through comprehensive curation efforts.

View all literature mentions

Research Collaboratory for Structural Bioinformatics Protein Data Bank (RCSB PDB) (tool)

RRID:SCR_012820

Collection of structural data of biological macromolecules. Database of information about 3D structures of large biological molecules, including proteins and nucleic acids. Users can perform queries on data and analyze and visualize results.

View all literature mentions