• Register
Forgot Password

If you have forgotten your password you can enter your email here and get a temporary password sent to your email.


Leaving Community

Are you sure you want to leave this community? Leaving the community will revoke any permissions you have been granted in this community.


PIRSF: family classification system at the Protein Information Resource.

The Protein Information Resource (PIR) is an integrated public resource of protein informatics. To facilitate the sensible propagation and standardization of protein annotation and the systematic detection of annotation errors, PIR has extended its superfamily concept and developed the SuperFamily (PIRSF) classification system. Based on the evolutionary relationships of whole proteins, this classification system allows annotation of both specific biological and generic biochemical functions. The system adopts a network structure for protein classification from superfamily to subfamily levels. Protein family members are homologous (sharing common ancestry) and homeomorphic (sharing full-length sequence similarity with common domain architecture). The PIRSF database consists of two data sets, preliminary clusters and curated families. The curated families include family name, protein membership, parent-child relationship, domain architecture, and optional description and bibliography. PIRSF is accessible from the website at http://pir.georgetown.edu/pirsf/ for report retrieval and sequence classification. The report presents family annotation, membership statistics, cross-references to other databases, graphical display of domain architecture, and links to multiple sequence alignments and phylogenetic trees for curated families. PIRSF can be utilized to analyze phylogenetic profiles, to reveal functional convergence and divergence, and to identify interesting relationships between homeomorphic families, domains and structural classes.

Pubmed ID: 14681371


  • Wu CH
  • Nikolskaya A
  • Huang H
  • Yeh LS
  • Natale DA
  • Vinayaka CR
  • Hu ZZ
  • Mazumder R
  • Kumar S
  • Kourtesis P
  • Ledley RS
  • Suzek BE
  • Arminski L
  • Chen Y
  • Zhang J
  • Cardenas JL
  • Chung S
  • Castro-Alvear J
  • Dinkov G
  • Barker WC


Nucleic acids research

Publication Data

January 1, 2004

Associated Grants

  • Agency: NHGRI NIH HHS, Id: U01 HG02712

Mesh Terms

  • Amino Acid Motifs
  • Animals
  • Computational Biology
  • Databases, Protein
  • Evolution, Molecular
  • Humans
  • Information Storage and Retrieval
  • Internet
  • Protein Structure, Tertiary
  • Proteins