The mannose-6-phosphate isomerase (Mpi) locus in Semibalanus balanoides has been studied as a candidate gene for balancing selection for more than two decades. Previous work has shown that Mpi allozyme genotypes (fast and slow) have different frequencies across Atlantic intertidal zones due to selection on postsettlement survival (i.e., allele zonation). We present the complete gene sequence of the Mpi locus and quantify nucleotide polymorphism in S. balanoides, as well as divergence to its sister taxon Semibalanus cariosus We show that the slow allozyme contains a derived charge-altering amino acid polymorphism, and both allozyme classes correspond to two haplogroups with multiple internal haplotypes. The locus shows several footprints of balancing selection around the fast/slow site: an enrichment of positive Tajima's D for nonsynonymous mutations, an excess of polymorphism, and a spike in the levels of silent polymorphism relative to silent divergence, as well as a site frequency spectrum enriched for midfrequency mutations. We observe other departures from neutrality across the locus in both coding and noncoding regions. These include a nonsynonymous trans-species polymorphism and a recent mutation under selection within the fast haplogroup. The latter suggests ongoing allelic replacement of functionally relevant amino acid variants. Moreover, predicted models of Mpi protein structure provide insight into the functional significance of the putatively selected amino acid polymorphisms. While footprints of selection are widespread across the range of S. balanoides, our data show that intertidal zonation patterns are variable across both spatial and temporal scales. These data provide further evidence for heterogeneous selection on Mpi.
Pubmed ID: 32098846 RIS Download
Publication data is provided by the National Library of Medicine ® and PubMed ®. Data is retrieved from PubMed ® on a weekly schedule. For terms and conditions see the National Library of Medicine Terms and Conditions.
Collection of curated, non-redundant genomic DNA, transcript RNA, and protein sequences produced by NCBI. Provides a reference for genome annotation, gene identification and characterization, mutation and polymorphism analysis, expression studies, and comparative analyses. Accessed through the Nucleotide and Protein databases.
View all literature mentionsA portal to biomedical and genomic information. NCBI creates public databases, conducts research in computational biology, develops software tools for analyzing genome data, and disseminates biomedical information for the better understanding of molecular processes affecting human health and disease.
View all literature mentionsSoftware for gene prediction in eukaryotic genomic sequences. Serves as a basis for further steps in the analysis of sequenced and assembled eukaryotic genomes.
View all literature mentions