Searching across hundreds of databases

Our searching services are busy right now. Your search will reload in five seconds.

X
Forgot Password

If you have forgotten your password you can enter your email here and get a temporary password sent to your email.

X
Forgot Password

If you have forgotten your password you can enter your email here and get a temporary password sent to your email.

Interactive exploration of integrated biological datasets using context-sensitive workflows.

Frontiers in genetics | 2014

Network inference utilizes experimental high-throughput data for the reconstruction of molecular interaction networks where new relationships between the network entities can be predicted. Despite the increasing amount of experimental data, the parameters of each modeling technique cannot be optimized based on the experimental data alone, but needs to be qualitatively assessed if the components of the resulting network describe the experimental setting. Candidate list prioritization and validation builds upon data integration and data visualization. The application of tools supporting this procedure is limited to the exploration of smaller information networks because the display and interpretation of large amounts of information is challenging regarding the computational effort and the users' experience. The Ondex software framework was extended with customizable context-sensitive menus which allow additional integration and data analysis options for a selected set of candidates during interactive data exploration. We provide new functionalities for on-the-fly data integration using InterProScan, PubMed Central literature search, and sequence-based homology search. We applied the Ondex system to the integration of publicly available data for Aspergillus nidulans and analyzed transcriptome data. We demonstrate the advantages of our approach by proposing new hypotheses for the functional annotation of specific genes of differentially expressed fungal gene clusters. Our extension of the Ondex framework makes it possible to overcome the separation between data integration and interactive analysis. More specifically, computationally demanding calculations can be performed on selected sub-networks without losing any information from the whole network. Furthermore, our extensions allow for direct access to online biological databases which helps to keep the integrated information up-to-date.

Pubmed ID: 24600467 RIS Download

Associated grants

  • Agency: Biotechnology and Biological Sciences Research Council, United Kingdom
    Id: BB/F006012/1
  • Agency: Biotechnology and Biological Sciences Research Council, United Kingdom
    Id: BB/F006039/1
  • Agency: Biotechnology and Biological Sciences Research Council, United Kingdom
    Id: BB/G013969/1
  • Agency: Biotechnology and Biological Sciences Research Council, United Kingdom
    Id: BB/G014159/1

Publication data is provided by the National Library of Medicine ® and PubMed ®. Data is retrieved from PubMed ® on a weekly schedule. For terms and conditions see the National Library of Medicine Terms and Conditions.

This is a list of tools and resources that we have found mentioned in this publication.


ASPGD (tool)

RRID:SCR_002047

Database of genetic and molecular biological information about the filamentous fungi of the genus Aspergillus including information about genes and proteins of Aspergillus nidulans and Aspergillus fumigatus; descriptions and classifications of their biological roles, molecular functions, and subcellular localizations; gene, protein, and chromosome sequence information; tools for analysis and comparison of sequences; and links to literature information; as well as a multispecies comparative genomics browser tool (Sybil) for exploration of orthology and synteny across multiple sequenced Sgenus species. Also available are Gene Ontology (GO) and community resources. Based on the Candida Genome Database, the Aspergillus Genome Database is a resource for genomic sequence data and gene and protein information for Aspergilli. Among its many species, the genus contains an excellent model organism (A. nidulans, or its teleomorph Emericella nidulans), an important pathogen of the immunocompromised (A. fumigatus), an agriculturally important toxin producer (A. flavus), and two species used in industrial processes (A. niger and A. oryzae). Search options allow you to: *Search AspGD database using keywords. *Find chromosomal features that match specific properties or annotations. *Find AspGD web pages using keywords located on the page. *Find information on one gene from many databases. *Search for keywords related to a phenotype (e.g., conidiation), an allele (such as veA1), or an experimental condition (e.g., light). Analysis and Tools allow you to: *Find similarities between a sequence of interest and Aspergillus DNA or protein sequences. *Display and analyze an Aspergillus sequence (or other sequence) in many ways. *Navigate the chromosomes set. View nucleotide and protein sequence. *Find short DNA/protein sequence matches in Aspergillus. *Design sequencing and PCR primers for Aspergillus or other input sequences. *Display the restriction map for a Aspergillus or other input sequence. *Find similarities between a sequence of interest and fungal nucleotide or protein sequences. AspGD welcomes data submissions.

View all literature mentions

Gene Ontology (tool)

RRID:SCR_002811

Computable knowledge regarding functions of genes and gene products. GO resources include biomedical ontologies that cover molecular domains of all life forms as well as extensive compilations of gene product annotations to these ontologies that provide largely species-neutral, comprehensive statements about what gene products do. Used to standardize representation of gene and gene product attributes across species and databases.

View all literature mentions

Gene Expression Omnibus (GEO) (tool)

RRID:SCR_007303

Functional genomics data repository supporting MIAME-compliant data submissions. Includes microarray-based experiments measuring the abundance of mRNA, genomic DNA, and protein molecules, as well as non-array-based technologies such as serial analysis of gene expression (SAGE) and mass spectrometry proteomic technology. Array- and sequence-based data are accepted. Collection of curated gene expression DataSets, as well as original Series and Platform records. The database can be searched using keywords, organism, DataSet type and authors. DataSet records contain additional resources including cluster tools and differential expression queries.

View all literature mentions

Kyoto Encyclopedia of Genes and Genomes Expression Database (tool)

RRID:SCR_001120

Database for mapping gene expression profiles to pathways and genomes. Repository of microarray gene expression profile data for Synechocystis PCC6803 (syn), Bacillus subtilis (bsu), Escherichia coli W3110 (ecj), Anabaena PCC7120 (ana), and other species contributed by the Japanese research community.

View all literature mentions

UniProtKB (tool)

RRID:SCR_004426

Central repository for collection of functional information on proteins, with accurate and consistent annotation. In addition to capturing core data mandatory for each UniProtKB entry (mainly, the amino acid sequence, protein name or description, taxonomic data and citation information), as much annotation information as possible is added. This includes widely accepted biological ontologies, classifications and cross-references, and experimental and computational data. The UniProt Knowledgebase consists of two sections, UniProtKB/Swiss-Prot and UniProtKB/TrEMBL. UniProtKB/Swiss-Prot (reviewed) is a high quality manually annotated and non-redundant protein sequence database which brings together experimental results, computed features, and scientific conclusions. UniProtKB/TrEMBL (unreviewed) contains protein sequences associated with computationally generated annotation and large-scale functional characterization that await full manual annotation. Users may browse by taxonomy, keyword, gene ontology, enzyme class or pathway.

View all literature mentions

LIMMA (tool)

RRID:SCR_010943

Software package for the analysis of gene expression microarray data, especially the use of linear models for analyzing designed experiments and the assessment of differential expression.

View all literature mentions

biomaRt (tool)

RRID:SCR_019214

Software package that integrates BioMart data resources with data analysis software in Bioconductor. Can annotate range of gene or gene product identifiers including Entrez Gene and Affymetrix probe identifiers with information such as gene symbol, chromosomal coordinates, Gene Ontology and OMIM annotation. Enables retrieval of genomic sequences and single nucleotide polymorphism information, which can be used in data analysis.

View all literature mentions