• Register
Forgot Password

If you have forgotten your password you can enter your email here and get a temporary password sent to your email.


Leaving Community

Are you sure you want to leave this community? Leaving the community will revoke any permissions you have been granted in this community.


InParanoid 7: new algorithms and tools for eukaryotic orthology analysis.

The InParanoid project gathers proteomes of completely sequenced eukaryotic species plus Escherichia coli and calculates pairwise ortholog relationships among them. The new release 7.0 of the database has grown by an order of magnitude over the previous version and now includes 100 species and their collective 1.3 million proteins organized into 42.7 million pairwise ortholog groups. The InParanoid algorithm itself has been revised and is now both more specific and sensitive. Based on results from our recent benchmarking of low-complexity filters in homology assignment, a two-pass BLAST approach was developed that makes use of high-precision compositional score matrix adjustment, but avoids the alignment truncation that sometimes follows. We have also updated the InParanoid web site (http://InParanoid.sbc.su.se). Several features have been added, the response times have been improved and the site now sports a new, clearer look. As the number of ortholog databases has grown, it has become difficult to compare among these resources due to a lack of standardized source data and incompatible representations of ortholog relationships. To facilitate data exchange and comparisons among ortholog databases, we have developed and are making available two XML schemas: SeqXML for the input sequences and OrthoXML for the output ortholog clusters.

Pubmed ID: 19892828


  • Ostlund G
  • Schmitt T
  • Forslund K
  • K√∂stler T
  • Messina DN
  • Roopra S
  • Frings O
  • Sonnhammer EL


Nucleic acids research

Publication Data

January 22, 2010

Associated Grants


Mesh Terms

  • Algorithms
  • Animals
  • Cluster Analysis
  • Computational Biology
  • Databases, Genetic
  • Databases, Nucleic Acid
  • Escherichia coli
  • Eukaryotic Cells
  • Genome, Bacterial
  • Humans
  • Information Storage and Retrieval
  • Internet
  • Protein Structure, Tertiary
  • Proteins
  • Proteomics
  • Software