The advent of sexual reproduction and the evolution of a dedicated germline in multicellular organisms are critical landmarks in eukaryotic evolution. We report an ancient family of GCNA (germ cell nuclear antigen) proteins that arose in the earliest eukaryotes, and feature a rapidly evolving intrinsically disordered region (IDR). Phylogenetic analysis reveals that GCNA proteins emerged before the major eukaryotic lineages diverged; GCNA predates the origin of a dedicated germline by a billion years. Gcna gene expression is enriched in reproductive cells across eukarya - either just prior to or during meiosis in single-celled eukaryotes, and in stem cells and germ cells of diverse multicellular animals. Studies of Gcna-mutant C. elegans and mice indicate that GCNA has functioned in reproduction for at least 600 million years. Homology to IDR-containing proteins implicated in DNA damage repair suggests that GCNA proteins may protect the genomic integrity of cells carrying a heritable genome.
Pubmed ID: 27718356 RIS Download
Publication data is provided by the National Library of Medicine ® and PubMed ®. Data is retrieved from PubMed ® on a weekly schedule. For terms and conditions see the National Library of Medicine Terms and Conditions.
THIS RESOURCE IS NO LONGER IN SERVICE. Documented on February 28,2023. This server takes a multiple-alignment file in either GCG''s MSF-format or Clustals ALN-format. Sponsors: This resource was supported by the Swiss EMBnet Node Server. Keywords: Server, Multiple-alignment,
View all literature mentionsThe Database of Protein Disorder (DisProt) is a curated database that provides information about proteins that lack fixed 3D structure in their putatively native states, either in their entirety or in part. Users can BLAST sequences, browse by protein name, or view by protein function and functional subclass.
View all literature mentionsCenter that produces knockout mice and carries out high-throughput phenotyping of each line in order to determine function of every gene in mouse genome. These mice will be preserved in repositories and made available to scientific community representing valuable resource for basic scientific research as well as generating new models for human diseases.
View all literature mentionslaboratory mouse with name C57BL/6N from MGI.
View all literature mentionsCaenorhabditis elegans with name C. elegans wild type (ancestral). from WB.
View all literature mentionsThis monoclonal targets Germ cell nuclear antigen 1; GCNA1
View all literature mentionsThis monoclonal targets Germ cell-specific antigen antibody [TRA98]
View all literature mentionsThis unknown targets isotype control
View all literature mentionsThis unknown targets Sheep Digoxigenin
View all literature mentionsThis unknown targets DNP
View all literature mentionsThis unknown targets IgG
View all literature mentionsThis monoclonal targets Penta-His
View all literature mentionsThis unknown targets Rat IgG, whole molecule
View all literature mentionsThis monoclonal targets GATA-4 (G-4)
View all literature mentionsMultiple sequence alignment method with reduced time and space complexity.Multiple sequence alignment with high accuracy and high throughput. Data analysis service for multiple sequence comparison by log- expectation.
View all literature mentionsCentral repository for collection of functional information on proteins, with accurate and consistent annotation. In addition to capturing core data mandatory for each UniProtKB entry (mainly, the amino acid sequence, protein name or description, taxonomic data and citation information), as much annotation information as possible is added. This includes widely accepted biological ontologies, classifications and cross-references, and experimental and computational data. The UniProt Knowledgebase consists of two sections, UniProtKB/Swiss-Prot and UniProtKB/TrEMBL. UniProtKB/Swiss-Prot (reviewed) is a high quality manually annotated and non-redundant protein sequence database which brings together experimental results, computed features, and scientific conclusions. UniProtKB/TrEMBL (unreviewed) contains protein sequences associated with computationally generated annotation and large-scale functional characterization that await full manual annotation. Users may browse by taxonomy, keyword, gene ontology, enzyme class or pathway.
View all literature mentionsComputational biology resource for investigating candidate functional sites in eukarytic proteins. Functional sites which fit to the description linear motif are currently specified as patterns using Regular Expression rules. To improve the predictive power, context-based rules and logical filters are being developed and applied to reduce the amount of false positives. The current version of the ELM server provides core functionality including filtering by cell compartment, phylogeny, globular domain clash (using the SMART/Pfam databases) and structure. In addition, both the known ELM instances and any positionally conserved matches in sequences similar to ELM instance sequences are identified and displayed (see ELM instance mapper). Although the ELM resource contains a large collection of functional site motifs, the current set of motifs is not exhaustive.
View all literature mentionsA Python-based environment of open-source software for mathematics, science, and engineering. The core packages of SciPy include: NumPy, a base N-dimensional array package; SciPy Library, a fundamental library for scientific computing; and IPython, an enhanced interactive console.
View all literature mentionsA database of protein families, each represented by multiple sequence alignments and hidden Markov models (HMMs). Users can analyze protein sequences for Pfam matches, view Pfam family annotation and alignments, see groups of related families, look at the domain organization of a protein sequence, find the domains on a PDB structure, and query Pfam by keywords. There are two components to Pfam: Pfam-A and Pfam-B. Pfam-A entries are high quality, manually curated families that may automatically generate a supplement using the ADDA database. These automatically generated entries are called Pfam-B. Although of lower quality, Pfam-B families can be useful for identifying functionally conserved regions when no Pfam-A entries are found. Pfam also generates higher-level groupings of related families, known as clans (collections of Pfam-A entries which are related by similarity of sequence, structure or profile-HMM).
View all literature mentionsAn information resource for peptidases (also termed proteases, proteinases and proteolytic enzymes) and the proteins that inhibit them. The MEROPS database uses an hierarchical, structure-based classification of the peptidases. In this, each peptidase is assigned to a Family on the basis of statistically significant similarities in amino acid sequence, and families that are thought to be homologous are grouped together in a Clan. There is a Summary page for each family and clan, and these have indexes. Each of the Summary pages offers links to supplementary pages. About 3000 individual peptidases and inhibitors are included in the database, and there is a Summary page describing each one. You can navigate to this by any of several routes. There are indexes of Name, MEROPS Identifier and source Organism on the menu bar. Each Summary page describes the classification and nomenclature of the peptidase or inhibitor, and provides links to supplementary pages showing sequence identifiers, the structure if known, literature references and more.
View all literature mentionsThe primary aim in developing HHpred was to provide biologists with a method for sequence database searching and structure prediction that is as easy to use as BLAST or PSI-BLAST and that is at the same time much more sensitive in finding remote homologs. In fact, HHpred''s sensitivity is competitive with the most powerful servers for structure prediction currently available. HHpred is the first server that is based on the pairwise comparison of profile hidden Markov models (HMMs). Whereas most conventional sequence search methods search sequence databases such as UniProt or the NR, HHpred searches alignment databases, like Pfam or SMART. This greatly simplifies the list of hits to a number of sequence families instead of a clutter of single sequences. All major publicly available profile and alignment databases are available through HHpred.
View all literature mentionsWeb server for flexible protein structure comparison. Structure alignment is formulated as the aligned fragment pairs chaining process allowing at most t twists, and the flexible structure alignment is transformed into a rigid structure alignment when t is forced to be 0.
View all literature mentionsDatabase of annotated three-dimensional comparative protein structure models generated by the fully automated homology-modelling pipeline SWISS-MODEL.
View all literature mentionsA database of three-dimensional protein models calculated by comparative modeling. ModBase is organized into datasets, which are either available to the public, to the academic community, or to specific users. 20 unique amidohydrolase and 41 unique enolase structures have been determined have been included in the database.
View all literature mentionsWeb server as integrated platform for automated protein structure and function prediction. Used for protein 3D structure prediction. Resource for automated protein structure prediction and structure-based function annotation.
View all literature mentionsWeb tool for discovery and visualization of differences in amino acid composition. Two samples of amino acid sequences serve as input and a bar chart composed of twenty data points is output.
View all literature mentionsWeb-based software used for the selection of best-fit models of protein evolution.
View all literature mentionsWeb phylogeny server based on the maximum-likelihood principle.
View all literature mentionsPortal providing access to all JGI genomic databases and analytical tools, sequencing projects and their status, search for and download assemblies and annotations of sequenced genomes, and interactively explore those genomes and compare them with other sequenced microbes, fungi, plants or metagenomes using specialized systems tailored to each particular class of organisms. The Department of Energy (DOE) Joint Genome Institute (JGI) is a national user facility with massive-scale DNA sequencing and analysis capabilities dedicated to advancing genomics for bioenergy and environmental applications. Beyond generating tens of trillions of DNA bases annually, the Institute develops and maintains data management systems and specialized analytical capabilities to manage and interpret complex genomic data sets, and to enable an expanding community of users around the world to analyze these data in different contexts over the web.
View all literature mentionsWeb search tool to find regions of similarity between biological sequences. Program compares nucleotide or protein sequences to sequence databases and calculates statistical significance. Used for identifying homologous sequences.
View all literature mentionsA tool for creating logos representing both sequence alignments and profile hidden Markov models. The interactive logos enable scrolling, zooming, and inspection of underlying values. Skylign can avoid sampling bias in sequence alignments by down-weighting redundant sequences and by combining observed counts with informed priors. It also simplifies the representation of gap parameters, and can optionally scale letter heights based on alternate calculations of the conservation of a position.
View all literature mentionsTool for searching sequence databases for homologs of protein sequences, and for making protein sequence alignments. It implements methods using probabilistic models called profile hidden Markov models (profile HMMs). Compared to BLAST, FASTA, and other sequence alignment and database search tools based on older scoring methodology, HMMER aims to be significantly more accurate and more able to detect remote homologs because of the strength of its underlying mathematical models. In the past, this strength came at significant computational expense, but in the new HMMER3 project, HMMER is now essentially as fast as BLAST.
View all literature mentionsA free program for multiple sequence alignment editing, visualisation and analysis that is available in two forms: a lightweight Java applet for use in web applications, and a powerful desktop application that employs web services for sequence alignment, secondary structure prediction and the retrieval of alignments, sequences, annotation and structures from public databases and any DAS 1.53 compliant sequence or annotation server. Use it to view and edit sequence alignments, analyse them with phylogenetic trees and principal components analysis (PCA) plots and explore molecular structures and annotation. Jalview has built in DNA, RNA and protein sequence and structure visualisation and analysis capabilities. It uses Jmol to view 3D structures, and VARNA to display RNA secondary structure.
View all literature mentionsA web server which recognizes tertiary structures from an amino acid sequence based on estimated pairwise energy content. Users can input SWISS-PROT/TrEMBL identifier or accession number, or paste the amino acid sequence.
View all literature mentionsCell line JM8 is a Embryonic stem cell with a species of origin Mus musculus (Mouse)
View all literature mentionsMus musculus with name ES Parental cell line E14Tg2a.4 from MMRRC.
View all literature mentionsRepository of mouse vectors, ES cells, mice, embryos, and sperm generated by NIH KOMP Mutagenesis Project. In addition, KOMP Repository offers services in support of KOMP products, including ES cell microinjection, vector cloning, post-insertional modification of cloned ES cells, cryopreservation, assisted reproduction techniques (IVF, ICSI) and mouse breeding, pathology services, phenotyping services, etc. KOMP Repository is final component of more than $50 million trans-NIH initiative to increase availability of genetically altered mice and related materials. The University of California, Davis (UC Davis) and Children''s Hospital Oakland Research Institute (CHORI) in Oakland, Calif., are collaborating to preserve, protect, and make available about 8,500 types of knockout mice and related products available to research community. Products are generated by two KOMP mutagenesis teams (CSD consortium and Regeneron Inc). All KOMP products generated by CSD consortium and Regeneron are available through KOMP Repository. Notice as of December 19, 2019: Materials from KOMP Repository have been deposited into MMRRC, including all mouse models and mouse embryonic stem cell lines. Eventually www.komp.org will be sunsetting, and IMSR will remove KOMP Repository listings, since they were double listed in MMRRC. MMRRC will contain the most accurate and up to date resource models.
View all literature mentionsSoftware analysis package for molecular biology community. Automatically copes with data in variety of formats and allows transparent retrieval of sequence data from web. Libraries are provided with package. Provides toolkit for creating bioinformatics applications or workflows. Provides set of sequence analysis programs. Provided programs cover areas such as sequence alignment, rapid database searching with sequence patterns, protein motif identification, nucleotide sequence pattern analysis, codon usage analysis for small genomes, rapid identification of sequence patterns in large scale sequence sets, and presentation tools for publication.
View all literature mentionsA a configurable software package for peptide and protein mass spectrometry analyses. It includes the SEQUEST search algorithm to identify separate proteins in complex mixtures, interactive navigation tools to filter and sort protein summaries, customized spectral plots, and chromatograms using the PEPMATCH and PEPMAP tools. This software also has batch processing capabilities to improve throughput by queuing up several files, and custom-build proprietary databases, index databases, and retrieve databases through a public server.
View all literature mentionsAllele Detail: Transgenic This is a legacy resource.
View all literature mentionsRepository of mouse vectors, ES cells, mice, embryos, and sperm generated by NIH KOMP Mutagenesis Project. In addition, KOMP Repository offers services in support of KOMP products, including ES cell microinjection, vector cloning, post-insertional modification of cloned ES cells, cryopreservation, assisted reproduction techniques (IVF, ICSI) and mouse breeding, pathology services, phenotyping services, etc. KOMP Repository is final component of more than $50 million trans-NIH initiative to increase availability of genetically altered mice and related materials. The University of California, Davis (UC Davis) and Children''s Hospital Oakland Research Institute (CHORI) in Oakland, Calif., are collaborating to preserve, protect, and make available about 8,500 types of knockout mice and related products available to research community. Products are generated by two KOMP mutagenesis teams (CSD consortium and Regeneron Inc). All KOMP products generated by CSD consortium and Regeneron are available through KOMP Repository. Notice as of December 19, 2019: Materials from KOMP Repository have been deposited into MMRRC, including all mouse models and mouse embryonic stem cell lines. Eventually www.komp.org will be sunsetting, and IMSR will remove KOMP Repository listings, since they were double listed in MMRRC. MMRRC will contain the most accurate and up to date resource models.
View all literature mentionsMus musculus with name ES Parental cell line E14Tg2a.4 from MMRRC.
View all literature mentionsThis monoclonal targets Penta-His
View all literature mentionsTool for searching sequence databases for homologs of protein sequences, and for making protein sequence alignments. It implements methods using probabilistic models called profile hidden Markov models (profile HMMs). Compared to BLAST, FASTA, and other sequence alignment and database search tools based on older scoring methodology, HMMER aims to be significantly more accurate and more able to detect remote homologs because of the strength of its underlying mathematical models. In the past, this strength came at significant computational expense, but in the new HMMER3 project, HMMER is now essentially as fast as BLAST.
View all literature mentionsA tool for creating logos representing both sequence alignments and profile hidden Markov models. The interactive logos enable scrolling, zooming, and inspection of underlying values. Skylign can avoid sampling bias in sequence alignments by down-weighting redundant sequences and by combining observed counts with informed priors. It also simplifies the representation of gap parameters, and can optionally scale letter heights based on alternate calculations of the conservation of a position.
View all literature mentionsMultiple sequence alignment method with reduced time and space complexity.Multiple sequence alignment with high accuracy and high throughput. Data analysis service for multiple sequence comparison by log- expectation.
View all literature mentionsA free program for multiple sequence alignment editing, visualisation and analysis that is available in two forms: a lightweight Java applet for use in web applications, and a powerful desktop application that employs web services for sequence alignment, secondary structure prediction and the retrieval of alignments, sequences, annotation and structures from public databases and any DAS 1.53 compliant sequence or annotation server. Use it to view and edit sequence alignments, analyse them with phylogenetic trees and principal components analysis (PCA) plots and explore molecular structures and annotation. Jalview has built in DNA, RNA and protein sequence and structure visualisation and analysis capabilities. It uses Jmol to view 3D structures, and VARNA to display RNA secondary structure.
View all literature mentionsThis monoclonal targets Penta-His
View all literature mentionsA web server which recognizes tertiary structures from an amino acid sequence based on estimated pairwise energy content. Users can input SWISS-PROT/TrEMBL identifier or accession number, or paste the amino acid sequence.
View all literature mentionsCell line JM8 is a Embryonic stem cell with a species of origin Mus musculus (Mouse)
View all literature mentionsThe primary aim in developing HHpred was to provide biologists with a method for sequence database searching and structure prediction that is as easy to use as BLAST or PSI-BLAST and that is at the same time much more sensitive in finding remote homologs. In fact, HHpred''s sensitivity is competitive with the most powerful servers for structure prediction currently available. HHpred is the first server that is based on the pairwise comparison of profile hidden Markov models (HMMs). Whereas most conventional sequence search methods search sequence databases such as UniProt or the NR, HHpred searches alignment databases, like Pfam or SMART. This greatly simplifies the list of hits to a number of sequence families instead of a clutter of single sequences. All major publicly available profile and alignment databases are available through HHpred.
View all literature mentionsWeb phylogeny server based on the maximum-likelihood principle.
View all literature mentionsThis unknown targets IgG
View all literature mentionsPortal providing access to all JGI genomic databases and analytical tools, sequencing projects and their status, search for and download assemblies and annotations of sequenced genomes, and interactively explore those genomes and compare them with other sequenced microbes, fungi, plants or metagenomes using specialized systems tailored to each particular class of organisms. The Department of Energy (DOE) Joint Genome Institute (JGI) is a national user facility with massive-scale DNA sequencing and analysis capabilities dedicated to advancing genomics for bioenergy and environmental applications. Beyond generating tens of trillions of DNA bases annually, the Institute develops and maintains data management systems and specialized analytical capabilities to manage and interpret complex genomic data sets, and to enable an expanding community of users around the world to analyze these data in different contexts over the web.
View all literature mentionsWeb search tool to find regions of similarity between biological sequences. Program compares nucleotide or protein sequences to sequence databases and calculates statistical significance. Used for identifying homologous sequences.
View all literature mentionsAn information resource for peptidases (also termed proteases, proteinases and proteolytic enzymes) and the proteins that inhibit them. The MEROPS database uses an hierarchical, structure-based classification of the peptidases. In this, each peptidase is assigned to a Family on the basis of statistically significant similarities in amino acid sequence, and families that are thought to be homologous are grouped together in a Clan. There is a Summary page for each family and clan, and these have indexes. Each of the Summary pages offers links to supplementary pages. About 3000 individual peptidases and inhibitors are included in the database, and there is a Summary page describing each one. You can navigate to this by any of several routes. There are indexes of Name, MEROPS Identifier and source Organism on the menu bar. Each Summary page describes the classification and nomenclature of the peptidase or inhibitor, and provides links to supplementary pages showing sequence identifiers, the structure if known, literature references and more.
View all literature mentionsWeb server for flexible protein structure comparison. Structure alignment is formulated as the aligned fragment pairs chaining process allowing at most t twists, and the flexible structure alignment is transformed into a rigid structure alignment when t is forced to be 0.
View all literature mentionsA database of three-dimensional protein models calculated by comparative modeling. ModBase is organized into datasets, which are either available to the public, to the academic community, or to specific users. 20 unique amidohydrolase and 41 unique enolase structures have been determined have been included in the database.
View all literature mentionsComputational biology resource for investigating candidate functional sites in eukarytic proteins. Functional sites which fit to the description linear motif are currently specified as patterns using Regular Expression rules. To improve the predictive power, context-based rules and logical filters are being developed and applied to reduce the amount of false positives. The current version of the ELM server provides core functionality including filtering by cell compartment, phylogeny, globular domain clash (using the SMART/Pfam databases) and structure. In addition, both the known ELM instances and any positionally conserved matches in sequences similar to ELM instance sequences are identified and displayed (see ELM instance mapper). Although the ELM resource contains a large collection of functional site motifs, the current set of motifs is not exhaustive.
View all literature mentionsA Python-based environment of open-source software for mathematics, science, and engineering. The core packages of SciPy include: NumPy, a base N-dimensional array package; SciPy Library, a fundamental library for scientific computing; and IPython, an enhanced interactive console.
View all literature mentionsA database of protein families, each represented by multiple sequence alignments and hidden Markov models (HMMs). Users can analyze protein sequences for Pfam matches, view Pfam family annotation and alignments, see groups of related families, look at the domain organization of a protein sequence, find the domains on a PDB structure, and query Pfam by keywords. There are two components to Pfam: Pfam-A and Pfam-B. Pfam-A entries are high quality, manually curated families that may automatically generate a supplement using the ADDA database. These automatically generated entries are called Pfam-B. Although of lower quality, Pfam-B families can be useful for identifying functionally conserved regions when no Pfam-A entries are found. Pfam also generates higher-level groupings of related families, known as clans (collections of Pfam-A entries which are related by similarity of sequence, structure or profile-HMM).
View all literature mentionsWeb-based software used for the selection of best-fit models of protein evolution.
View all literature mentionsThis unknown targets DNP
View all literature mentionsMultiple sequence alignment method with reduced time and space complexity.Multiple sequence alignment with high accuracy and high throughput. Data analysis service for multiple sequence comparison by log- expectation.
View all literature mentionsCentral repository for collection of functional information on proteins, with accurate and consistent annotation. In addition to capturing core data mandatory for each UniProtKB entry (mainly, the amino acid sequence, protein name or description, taxonomic data and citation information), as much annotation information as possible is added. This includes widely accepted biological ontologies, classifications and cross-references, and experimental and computational data. The UniProt Knowledgebase consists of two sections, UniProtKB/Swiss-Prot and UniProtKB/TrEMBL. UniProtKB/Swiss-Prot (reviewed) is a high quality manually annotated and non-redundant protein sequence database which brings together experimental results, computed features, and scientific conclusions. UniProtKB/TrEMBL (unreviewed) contains protein sequences associated with computationally generated annotation and large-scale functional characterization that await full manual annotation. Users may browse by taxonomy, keyword, gene ontology, enzyme class or pathway.
View all literature mentionsThis monoclonal targets Germ cell-specific antigen antibody [TRA98]
View all literature mentionsThis monoclonal targets Germ cell nuclear antigen 1; GCNA1
View all literature mentionsDatabase of annotated three-dimensional comparative protein structure models generated by the fully automated homology-modelling pipeline SWISS-MODEL.
View all literature mentionsWeb server as integrated platform for automated protein structure and function prediction. Used for protein 3D structure prediction. Resource for automated protein structure prediction and structure-based function annotation.
View all literature mentionsWeb tool for discovery and visualization of differences in amino acid composition. Two samples of amino acid sequences serve as input and a bar chart composed of twenty data points is output.
View all literature mentionsThis unknown targets Sheep Digoxigenin
View all literature mentionsSoftware analysis package for molecular biology community. Automatically copes with data in variety of formats and allows transparent retrieval of sequence data from web. Libraries are provided with package. Provides toolkit for creating bioinformatics applications or workflows. Provides set of sequence analysis programs. Provided programs cover areas such as sequence alignment, rapid database searching with sequence patterns, protein motif identification, nucleotide sequence pattern analysis, codon usage analysis for small genomes, rapid identification of sequence patterns in large scale sequence sets, and presentation tools for publication.
View all literature mentionsCentral repository for collection of functional information on proteins, with accurate and consistent annotation. In addition to capturing core data mandatory for each UniProtKB entry (mainly, the amino acid sequence, protein name or description, taxonomic data and citation information), as much annotation information as possible is added. This includes widely accepted biological ontologies, classifications and cross-references, and experimental and computational data. The UniProt Knowledgebase consists of two sections, UniProtKB/Swiss-Prot and UniProtKB/TrEMBL. UniProtKB/Swiss-Prot (reviewed) is a high quality manually annotated and non-redundant protein sequence database which brings together experimental results, computed features, and scientific conclusions. UniProtKB/TrEMBL (unreviewed) contains protein sequences associated with computationally generated annotation and large-scale functional characterization that await full manual annotation. Users may browse by taxonomy, keyword, gene ontology, enzyme class or pathway.
View all literature mentionsA a configurable software package for peptide and protein mass spectrometry analyses. It includes the SEQUEST search algorithm to identify separate proteins in complex mixtures, interactive navigation tools to filter and sort protein summaries, customized spectral plots, and chromatograms using the PEPMATCH and PEPMAP tools. This software also has batch processing capabilities to improve throughput by queuing up several files, and custom-build proprietary databases, index databases, and retrieve databases through a public server.
View all literature mentionsThis unknown targets Rat IgG, whole molecule
View all literature mentionsThis monoclonal targets Germ cell-specific antigen antibody [TRA98]
View all literature mentionsThis unknown targets isotype control
View all literature mentionsThis monoclonal targets Germ cell nuclear antigen 1; GCNA1
View all literature mentionsThis monoclonal targets GATA-4 (G-4)
View all literature mentionsAllele Detail: Transgenic This is a legacy resource.
View all literature mentions