• Register
X
Forgot Password

If you have forgotten your password you can enter your email here and get a temporary password sent to your email.

X

Leaving Community

Are you sure you want to leave this community? Leaving the community will revoke any permissions you have been granted in this community.

No
Yes

NCBI Reference Sequences: current status, policy and new initiatives.

NCBI's Reference Sequence (RefSeq) database (http://www.ncbi.nlm.nih.gov/RefSeq/) is a curated non-redundant collection of sequences representing genomes, transcripts and proteins. RefSeq records integrate information from multiple sources and represent a current description of the sequence, the gene and sequence features. The database includes over 5300 organisms spanning prokaryotes, eukaryotes and viruses, with records for more than 5.5 x 10(6) proteins (RefSeq release 30). Feature annotation is applied by a combination of curation, collaboration, propagation from other sources and computation. We report here on the recent growth of the database, recent changes to feature annotations and record types for eukaryotic (primarily vertebrate) species and policies regarding species inclusion and genome annotation. In addition, we introduce RefSeqGene, a new initiative to support reporting variation data on a stable genomic coordinate system.

Pubmed ID: 18927115

Authors

  • Pruitt KD
  • Tatusova T
  • Klimke W
  • Maglott DR

Journal

Nucleic acids research

Publication Data

January 16, 2009

Associated Grants

None

Mesh Terms

  • Animals
  • Databases, Genetic
  • Exons
  • Genomics
  • Humans
  • Mice
  • Proteins
  • Pseudogenes
  • RNA, Untranslated
  • Reference Standards
  • Sequence Analysis