Searching across hundreds of databases

Our searching services are busy right now. Your search will reload in five seconds.

X
Forgot Password

If you have forgotten your password you can enter your email here and get a temporary password sent to your email.

Fast, scalable generation of high-quality protein multiple sequence alignments using Clustal Omega.

Molecular systems biology | Oct 11, 2011

Multiple sequence alignments are fundamental to many sequence analysis methods. Most alignments are computed using the progressive alignment heuristic. These methods are starting to become a bottleneck in some analysis pipelines when faced with data sets of the size of many thousands of sequences. Some methods allow computation of larger data sets while sacrificing quality, and others produce high-quality alignments, but scale badly with the number of sequences. In this paper, we describe a new program called Clustal Omega, which can align virtually any number of protein sequences quickly and that delivers accurate alignments. The accuracy of the package on smaller test cases is similar to that of the high-quality aligners. On larger data sets, Clustal Omega outperforms other packages in terms of execution time and quality. Clustal Omega also has powerful features for adding sequences to and exploiting information in existing alignments, making use of the vast amount of precomputed information in public databases like Pfam.

Pubmed ID: 21988835 RIS Download

Mesh terms: Algorithms | Amino Acid Sequence | Base Sequence | Data Mining | Databases, Factual | Molecular Sequence Data | Proteins | Sequence Alignment | Sequence Analysis, Protein | Software | Systems Biology

Publication data is provided by the National Library of Medicine ® and PubMed ®. Data is retrieved from PubMed ® on a weekly schedule. For terms and conditions see the National Library of Medicine Terms and Conditions.

This is a list of tools and resources that we have found mentioned in this publication.


Eddy Lab Software

Software library containing tools for statistical manipulations of data. Tools include profile hidden Markov models for biological sequence analysis, RNA structure analysis, and a prototype noncoding RNA genefinder.

tool

View all literature mentions

SourceForge

An open source community of user-contributed and user-built software tools and resources.

tool

View all literature mentions

European Bioinformatics Institute

A non-profit academic organization for research and services in bioinformatics that provides freely available data from life science experiments, performs basic research in computational biology, and offers an extensive user training programme, supporting researchers in academia and industry. The Institute manages databases of biological data including nucleic acid, protein sequences, and macromolecular structures.

tool

View all literature mentions