• Register
Forgot Password

If you have forgotten your password you can enter your email here and get a temporary password sent to your email.


Leaving Community

Are you sure you want to leave this community? Leaving the community will revoke any permissions you have been granted in this community.


Threading a database of protein cores.

We present an analysis of 10 blind predictions prepared for a recent conference, "Critical Assessment of Techniques for Protein Structure Prediction." The sequences of these proteins are not detectably similar to those of any protein in the structure database then available, but we attempted, by a threading method, to recognize similarity to known domain folds. Four of the 10 proteins, as we subsequently learned, do indeed show significant similarity to then-known structures. For 2 of these proteins the predictions were accurate, in the sense that a similar structure was at or near the top of the list of threading scores, and the threading alignment agreed well with the corresponding structural alignment. For the best predicted model mean alignment error relative to the optimal structural alignment was 2.7 residues, arising entirely from small "register shifts" of strands or helices. In the analysis we attempt to identify factors responsible for these successes and failures. Since our threading method does not use gap penalties, we may readily distinguish between errors arising from our prior definition of the "cores" of known structures and errors arising from inherent limitations in the threading potential. It would appear from the results that successful substructure recognition depends most critically on accurate definition of the "fold" of a database protein. This definition must correctly delineate substructures that are, and are not, likely to be conserved during protein evolution.

Pubmed ID: 8710828


  • Madej T
  • Gibrat JF
  • Bryant SH



Publication Data

November 12, 1995

Associated Grants


Mesh Terms

  • Amino Acid Sequence
  • Bacterial Proteins
  • Computer Graphics
  • Computer Simulation
  • DNA-Binding Proteins
  • Databases, Factual
  • Drosophila Proteins
  • Glycoside Hydrolases
  • Models, Molecular
  • Protein Conformation
  • Protein Folding
  • Proteins
  • Pyruvate, Orthophosphate Dikinase
  • RNA-Binding Proteins
  • Sequence Alignment
  • Subtilisins
  • beta-Galactosidase