Preparing your results

Our searching services are busy right now. Your search will reload in five seconds.

X
Forgot Password

If you have forgotten your password you can enter your email here and get a temporary password sent to your email.

The subsystems approach to genome annotation and its use in the project to annotate 1000 genomes.

Nucleic acids research | Oct 10, 2005

The release of the 1000th complete microbial genome will occur in the next two to three years. In anticipation of this milestone, the Fellowship for Interpretation of Genomes (FIG) launched the Project to Annotate 1000 Genomes. The project is built around the principle that the key to improved accuracy in high-throughput annotation technology is to have experts annotate single subsystems over the complete collection of genomes, rather than having an annotation expert attempt to annotate all of the genes in a single genome. Using the subsystems approach, all of the genes implementing the subsystem are analyzed by an expert in that subsystem. An annotation environment was created where populated subsystems are curated and projected to new genomes. A portable notion of a populated subsystem was defined, and tools developed for exchanging and curating these objects. Tools were also developed to resolve conflicts between populated subsystems. The SEED is the first annotation environment that supports this model of annotation. Here, we describe the subsystem approach, and offer the first release of our growing library of populated subsystems. The initial release of data includes 180 177 distinct proteins with 2133 distinct functional roles. This data comes from 173 subsystems and 383 different organisms.

Pubmed ID: 16214803 RIS Download

Mesh terms: Acyl Coenzyme A | Coenzyme A | Computational Biology | Genome, Archaeal | Genome, Bacterial | Genomics | Internet | Leucine | Ribosomal Proteins | Software | Terminology as Topic | Vocabulary, Controlled

Publication data is provided by the National Library of Medicine ® and PubMed ®. Data is retrieved from PubMed ® on a weekly schedule. For terms and conditions see the National Library of Medicine Terms and Conditions.

This is a list of tools and resources that we have found mentioned in this publication.


GO

A community-based bioinformatics resource consisting of three structured controlled vocabularies (ontologies) for the annotation of gene products with respect to their molecular function, cellular component, and biological role in a species-independent manner. This initiative to standardize the representation of gene and gene product attributes across species and databases is an effort to address the need for consistent descriptions of gene products in different databases. The Gene Ontology project encourages input from the community into both the content of the GO and annotation using GO. There are three separate aspects to this effort: first, they write and maintain the ontologies themselves; second, they make cross-links between the ontologies and the genes and gene products in the collaborating databases; and third, they develop tools that facilitate the creation, maintenance and use of ontologies. The controlled vocabularies are structured so that users can query them at different levels: for example, uers can use GO to find all the gene products in the mouse genome that are involved in signal transduction, or users can zoom in on all the receptor tyrosine kinases. This structure also allows annotators to assign properties to gene products at different levels, depending on how much is known about a gene product.

tool

View all literature mentions