Searching across hundreds of databases

Our searching services are busy right now. Your search will reload in five seconds.

X
Forgot Password

If you have forgotten your password you can enter your email here and get a temporary password sent to your email.

X
Forgot Password

If you have forgotten your password you can enter your email here and get a temporary password sent to your email.

A Recent Whole-Genome Duplication Divides Populations of a Globally Distributed Microsporidian.

Molecular biology and evolution | 2016

The Microsporidia are a major group of intracellular fungi and important parasites of animals including insects, fish, and immunocompromised humans. Microsporidian genomes have undergone extreme reductive evolution but there are major differences in genome size and structure within the group: some are prokaryote-like in size and organisation (<3 Mb of gene-dense sequence) while others have more typically eukaryotic genome architectures. To gain fine-scale, population-level insight into the evolutionary dynamics of these tiny eukaryotic genomes, we performed the broadest microsporidian population genomic study to date, sequencing geographically isolated strains of Spraguea, a marine microsporidian infecting goosefish worldwide. Our analysis revealed that population structure across the Atlantic Ocean is associated with a conserved difference in ploidy, with American and Canadian isolates sharing an ancestral whole genome duplication that was followed by widespread pseudogenisation and sorting-out of paralogue pairs. While past analyses have suggested de novo gene formation of microsporidian-specific genes, we found evidence for the origin of new genes from noncoding sequence since the divergence of these populations. Some of these genes experience selective constraint, suggesting the evolution of new functions and local host adaptation. Combining our data with published microsporidian genomes, we show that nucleotide composition across the phylum is shaped by a mutational bias favoring A and T nucleotides, which is opposed by an evolutionary force favoring an increase in genomic GC content. This study reveals ongoing dramatic reorganization of genome structure and the evolution of new gene functions in modern microsporidians despite extensive genomic streamlining in their common ancestor.

Pubmed ID: 27189558 RIS Download

Associated grants

  • Agency: European Research Council, International
    Id: 268701
  • Agency: Wellcome Trust, United Kingdom

Publication data is provided by the National Library of Medicine ® and PubMed ®. Data is retrieved from PubMed ® on a weekly schedule. For terms and conditions see the National Library of Medicine Terms and Conditions.

This is a list of tools and resources that we have found mentioned in this publication.


R Project for Statistical Computing (tool)

RRID:SCR_001905

Software environment and programming language for statistical computing and graphics. R is integrated suite of software facilities for data manipulation, calculation and graphical display. Can be extended via packages. Some packages are supplied with the R distribution and more are available through CRAN family.It compiles and runs on wide variety of UNIX platforms, Windows and MacOS.

View all literature mentions

Stanford University; Stanford; California (tool)

RRID:SCR_011538

Private, non profit university in Stanford, California, USA for research and undergraduate and graduate studies. Known for its academic strength, wealth, proximity to Silicon Valley, and ranking as one of the world's top universities. Particularly noted for its entrepreneurship and is one of the most successful universities in attracting funding for start-ups.

View all literature mentions

MEME Suite - Motif-based sequence analysis tools (tool)

RRID:SCR_001783

Suite of motif-based sequence analysis tools to discover motifs using MEME, DREME (DNA only) or GLAM2 on groups of related DNA or protein sequences; search sequence databases with motifs using MAST, FIMO, MCAST or GLAM2SCAN; compare a motif to all motifs in a database of motifs; associate motifs with Gene Ontology terms via their putative target genes, and analyze motif enrichment using SpaMo or CentriMo. Source code, binaries and a web server are freely available for noncommercial use.

View all literature mentions

SAMTOOLS (tool)

RRID:SCR_002105

Original SAMTOOLS package has been split into three separate repositories including Samtools, BCFtools and HTSlib. Samtools for manipulating next generation sequencing data used for reading, writing, editing, indexing,viewing nucleotide alignments in SAM,BAM,CRAM format. BCFtools used for reading, writing BCF2,VCF, gVCF files and calling, filtering, summarising SNP and short indel sequence variants. HTSlib used for reading, writing high throughput sequencing data.

View all literature mentions

Cytoscape (tool)

RRID:SCR_003032

Software platform for complex network analysis and visualization. Used for visualization of molecular interaction networks and biological pathways and integrating these networks with annotations, gene expression profiles and other state data.

View all literature mentions

MatPlotLib (tool)

RRID:SCR_008624

Python 2D plotting library which produces publication quality figures in variety of hardcopy formats and interactive environments across platforms. Used in python scripts, web application servers, and six graphical user interface toolkits. Used to generate plots, histograms, power spectra, bar charts, error charts, scatter plots.

View all literature mentions

GENEPOP (tool)

RRID:SCR_009194

Population genetic data analysis software package. Used to perform exact Hardy Weinberg Equilibrium test. Used for population differentiation and for genotypic disequilibrium among pairs of loci. Computes estimates of F-statistics, null allele frequencies, allele size-based statistics for microsatellites, etc. and performs analyses of isolation by distance from pairwise comparisons of individuals or population samples.

View all literature mentions

Prodigal (tool)

RRID:SCR_011936

Software tool for protein coding gene prediction for prokaryotic genomes.

View all literature mentions