Forgot Password

If you have forgotten your password you can enter your email here and get a temporary password sent to your email.

Forgot Password

If you have forgotten your password you can enter your email here and get a temporary password sent to your email.

Publication

Plastid proteome prediction for diatoms and other algae with secondary plastids of the red lineage.

Ansgar Gruber | Gabrielle Rocap | Peter G Kroth | E Virginia Armbrust | Thomas Mock

The Plant journal : for cell and molecular biology | 2015

The plastids of ecologically and economically important algae from phyla such as stramenopiles, dinoflagellates and cryptophytes were acquired via a secondary endosymbiosis and are surrounded by three or four membranes. Nuclear-encoded plastid-localized proteins contain N-terminal bipartite targeting peptides with the conserved amino acid sequence motif 'ASAFAP'. Here we identify the plastid proteomes of two diatoms, Thalassiosira pseudonana and Phaeodactylum tricornutum, using a customized prediction tool (ASAFind) that identifies nuclear-encoded plastid proteins in algae with secondary plastids of the red lineage based on the output of SignalP and the identification of conserved 'ASAFAP' motifs and transit peptides. We tested ASAFind against a large reference dataset of diatom proteins with experimentally confirmed subcellular localization and found that the tool accurately identified plastid-localized proteins with both high sensitivity and high specificity. To identify nucleus-encoded plastid proteins of T. pseudonana and P. tricornutum we generated optimized sets of gene models for both whole genomes, to increase the percentage of full-length proteins compared with previous assembly model sets. ASAFind applied to these optimized sets revealed that about 8% of the proteins encoded in their nuclear genomes were predicted to be plastid localized and therefore represent the putative plastid proteomes of these algae.

Pubmed ID: 25438865 RIS Download

Research resources used in this publication

None found

Additional research tools detected in this publication

Antibodies used in this publication

None found

Associated grants

None

Publication data is provided by the National Library of Medicine ® and PubMed ®. Data is retrieved from PubMed ® on a weekly schedule. For terms and conditions see the National Library of Medicine Terms and Conditions.

This is a list of tools and resources that we have found mentioned in this publication.

DOE Joint Genome Institute (tool)

RRID:SCR_003045

Institute to advance genomics in support of the DOE missions related to clean energy generation and environmental characterization and cleanup. Supported by the DOE Office of Science, the DOE JGI unites the expertise at Lawrence Berkeley National Laboratory, Lawrence Livermore National Laboratory, and the HudsonAlpha Institute for Biotechnology. The facility provides integrated high-throughput sequencing and computational analysis that enable systems-based scientific approaches to these challenges.

View all literature mentions

JGI Genome Portal (tool)

RRID:SCR_004706

Portal providing access to all JGI genomic databases and analytical tools, sequencing projects and their status, search for and download assemblies and annotations of sequenced genomes, and interactively explore those genomes and compare them with other sequenced microbes, fungi, plants or metagenomes using specialized systems tailored to each particular class of organisms. The Department of Energy (DOE) Joint Genome Institute (JGI) is a national user facility with massive-scale DNA sequencing and analysis capabilities dedicated to advancing genomics for bioenergy and environmental applications. Beyond generating tens of trillions of DNA bases annually, the Institute develops and maintains data management systems and specialized analytical capabilities to manage and interpret complex genomic data sets, and to enable an expanding community of users around the world to analyze these data in different contexts over the web.

View all literature mentions

WEBLOGO (tool)

RRID:SCR_010236

Web application to generate sequence logos, graphical representations of patterns within multiple sequence alignment. Designed to make generation of sequence logos easy. Sequence logo generator.

View all literature mentions

Biopython (tool)

RRID:SCR_007173

Biopython is a set of freely available tools for biological computation written in Python by an international team of developers. It is a distributed collaborative effort to develop Python libraries and applications which address the needs of current and future work in bioinformatics. The source code is made available under the Biopython License, which is extremely liberal and compatible with almost every license in the world. It works along with the Open Bioinformatics Foundation, who generously host it''s website, bug tracker, and mailing lists. Sponsor: This resource is supported by the Open Bioinformatics Foundation. Keywords: Tool, Software, Python, Biological, Computation, Bioinformatics,

View all literature mentions

BioEdit (tool)

RRID:SCR_007361

Software tool as biological sequence alignment editor written for Windows 95/98/NT/2000/XP/7 and sequence analysis program. Provides sequence manipulation and analysis options and links to external analysis programs to view and manipulate sequences with simple point and click operations.

View all literature mentions

TBLASTN (tool)

RRID:SCR_011822

Tool to search translated nucleotide databases using a protein query.

View all literature mentions

SignalP (tool)

RRID:SCR_015644

Web application for prediction of the presence and location of signal peptide cleavage sites in amino acid sequences from different organisms. The method incorporates a prediction of cleavage sites and a signal peptide/non-signal peptide prediction based on a combination of several artificial neural networks.

View all literature mentions

JGI Genome Portal (tool)

RRID:SCR_002383

View all literature mentions

About

The SciCrunch Infrastructure was developed as a cooperative data platform to be used by diverse communities in making data more FAIR.

Contact Us

FAIR Data Informatics Lab

University of California, San Diego

9500 Gilman Drive, Mail Code 0608

La Jolla, CA 92093-0608

United States

info

scicrunch.org

About SciCrunch | Privacy Policy | Terms of Service

Searching across hundreds of databases

Our searching services are busy right now. Your search will reload in five seconds.

Plastid proteome prediction for diatoms and other algae with secondary plastids of the red lineage.

Research resources used in this publication

Additional research tools detected in this publication

Antibodies used in this publication

Associated grants

This is a list of tools and resources that we have found mentioned in this publication.