Searching across hundreds of databases

Our searching services are busy right now. Your search will reload in five seconds.

X
Forgot Password

If you have forgotten your password you can enter your email here and get a temporary password sent to your email.

X
Forgot Password

If you have forgotten your password you can enter your email here and get a temporary password sent to your email.

The origin and impeded dissemination of the DNA phosphorothioation system in prokaryotes.

Nature communications | 2021

Phosphorothioate (PT) modification by the dnd gene cluster is the first identified DNA backbone modification and constitute an epigenetic system with multiple functions, including antioxidant ability, restriction modification, and virus resistance. Despite these advantages for hosting dnd systems, they are surprisingly distributed sporadically among contemporary prokaryotic genomes. To address this ecological paradox, we systematically investigate the occurrence and phylogeny of dnd systems, and they are suggested to have originated in ancient Cyanobacteria after the Great Oxygenation Event. Interestingly, the occurrence of dnd systems and prophages is significantly negatively correlated. Further, we experimentally confirm that PT modification activates the filamentous phage SW1 by altering the binding affinity of repressor and the transcription level of its encoding gene. Competition assays, concurrent epigenomic and transcriptomic sequencing subsequently show that PT modification affects the expression of a variety of metabolic genes, which reduces the competitive fitness of the marine bacterium Shewanella piezotolerans WP3. Our findings strongly suggest that a series of negative effects on microorganisms caused by dnd systems limit horizontal gene transfer, thus leading to their sporadic distribution. Overall, our study reveals putative evolutionary scenario of the dnd system and provides novel insights into the physiological and ecological influences of PT modification.

Pubmed ID: 34737280 RIS Download

Publication data is provided by the National Library of Medicine ® and PubMed ®. Data is retrieved from PubMed ® on a weekly schedule. For terms and conditions see the National Library of Medicine Terms and Conditions.

This is a list of tools and resources that we have found mentioned in this publication.


RefSeq (tool)

RRID:SCR_003496

Collection of curated, non-redundant genomic DNA, transcript RNA, and protein sequences produced by NCBI. Provides a reference for genome annotation, gene identification and characterization, mutation and polymorphism analysis, expression studies, and comparative analyses. Accessed through the Nucleotide and Protein databases.

View all literature mentions

RAxML (tool)

RRID:SCR_006086

Software program for phylogenetic analyses of large datasets under maximum likelihood.

View all literature mentions

SciPy (tool)

RRID:SCR_008058

A Python-based environment of open-source software for mathematics, science, and engineering. The core packages of SciPy include: NumPy, a base N-dimensional array package; SciPy Library, a fundamental library for scientific computing; and IPython, an enhanced interactive console.

View all literature mentions

DEGseq (tool)

RRID:SCR_008480

R package to identify differentially expressed genes from RNA-Seq data.

View all literature mentions

DIAMOND (tool)

RRID:SCR_009457

Software to: view dicom files and assemble them into 3D volumes. View and convert between Analyze, Nifti, and Interfile. Classify and organize dicoms and 3D volumes using metadata. Search and report on a collection of scans.

View all literature mentions

MAFFT (tool)

RRID:SCR_011811

Software package as multiple alignment program for amino acid or nucleotide sequences. Can align up to 500 sequences or maximum file size of 1 MB. First version of MAFFT used algorithm based on progressive alignment, in which sequences were clustered with help of Fast Fourier Transform. Subsequent versions have added other algorithms and modes of operation, including options for faster alignment of large numbers of sequences, higher accuracy alignments, alignment of non-coding RNA sequences, and addition of new sequences to existing alignments.

View all literature mentions

KEGG (tool)

RRID:SCR_012773

Integrated database resource consisting of 16 main databases, broadly categorized into systems information, genomic information, and chemical information. In particular, gene catalogs in completely sequenced genomes are linked to higher-level systemic functions of cell, organism, and ecosystem. Analysis tools are also available. KEGG may be used as reference knowledge base for biological interpretation of large-scale datasets generated by sequencing and other high-throughput experimental technologies.

View all literature mentions

edgeR (tool)

RRID:SCR_012802

Bioconductor software package for Empirical analysis of Digital Gene Expression data in R. Used for differential expression analysis of RNA-seq and digital gene expression data with biological replication.

View all literature mentions

New England Biolabs (tool)

RRID:SCR_013517

An Antibody supplier

View all literature mentions

Agilent Technologies (tool)

RRID:SCR_013575

Company provides laboratories worldwide with analytical instruments and supplies, clinical and diagnostic testing services, consumables, applications and expertise in life sciences and applied chemical markets.

View all literature mentions

Primer Express (tool)

RRID:SCR_014326

Software that allows users to manually or automatically design custom primers and probes for gene quantitation and allelic discrimination (SNP) real-time PCR applications. It supports assays based on TaqMan and SYBR Green I dye chemistries.

View all literature mentions

FastTree (tool)

RRID:SCR_015501

Source code that infers approximately-maximum-likelihood phylogenetic trees from alignments of nucleotide or protein sequences. It uses the Jukes-Cantor or generalized time-reversible (GTR) models of nucleotide evolution and the JTT, WAG, or LG models of amino acid evolution.

View all literature mentions

HISAT2 (tool)

RRID:SCR_015530

Graph-based alignment of next generation sequencing reads to a population of genomes.

View all literature mentions