This service exclusively searches for literature that cites resources. Please be aware that the total number of searchable documents is limited to those containing RRIDs and does not include all open-access literature.
CRISPR/Cas base editors promise nucleotide-level control over DNA sequences, but the determinants of their activity remain incompletely understood. We measured base editing frequencies in two human cell lines for two cytosine and two adenine base editors at ∼14 000 target sequences and find that base editing activity is sequence-biased, with largest effects from nucleotides flanking the target base. Whether a base is edited depends strongly on the combination of its position in the target and the preceding base, acting to widen or narrow the effective editing window. The impact of features on editing rate depends on the position, with sequence bias efficacy mainly influencing bases away from the center of the window. We use these observations to train a machine learning model to predict editing activity per position, with accuracy ranging from 0.49 to 0.72 between editors, and with better generalization across datasets than existing tools. We demonstrate the usefulness of our model by predicting the efficacy of disease mutation correcting guides, and find that most of them suffer from more unwanted editing than pure outcomes. This work unravels the position-specificity of base editing biases and allows more efficient planning of editing campaigns in experimental and therapeutic contexts.
Hoogsteen (HG) base pairing is characterized by a 180° rotation of the purine base with respect to the Watson-Crick-Franklin (WCF) motif. Recently, it has been found that both conformations coexist in a dynamical equilibrium and that several biological functions require HG pairs. This relevance has motivated experimental and computational investigations of the base-pairing transition. However, a systematic simulation of sequence variations has remained out of reach. Here, we employ advanced path-based methods to perform unprecedented free-energy calculations. Our methodology enables us to study the different mechanisms of purine rotation, either remaining inside or after flipping outside of the double helix. We study seven different sequences, which are neighbor variations of a well-studied A⋅T pair in A6-DNA. We observe the known effect of A⋅T steps favoring HG stability, and find evidence of triple-hydrogen-bonded neighbors hindering the inside transition. More importantly, we identify a dominant factor: the direction of the A rotation, with the 6-ring pointing either towards the longer or shorter segment of the chain, respectively relating to a lower or higher barrier. This highlights the role of DNA's relative flexibility as a modulator of the WCF/HG dynamic equilibrium. Additionally, we provide a robust methodology for future HG proclivity studies.
Helicases couple the chemical energy of ATP hydrolysis to directional translocation along nucleic acids and transient duplex separation. Understanding helicase mechanism requires that the basic physicochemical process of base pair separation be understood. This necessitates monitoring helicase activity directly, at high spatio-temporal resolution. Using optical tweezers with single base pair (bp) resolution, we analyzed DNA unwinding by XPD helicase, a Superfamily 2 (SF2) DNA helicase involved in DNA repair and transcription initiation. We show that monomeric XPD unwinds duplex DNA in 1-bp steps, yet exhibits frequent backsteps and undergoes conformational transitions manifested in 5-bp backward and forward steps. Quantifying the sequence dependence of XPD stepping dynamics with near base pair resolution, we provide the strongest and most direct evidence thus far that forward, single-base pair stepping of a helicase utilizes the spontaneous opening of the duplex. The proposed unwinding mechanism may be a universal feature of DNA helicases that move along DNA phosphodiester backbones. DOI:http://dx.doi.org/10.7554/eLife.00334.001.
A variety of single-gene human diseases are caused by haploinsufficiency, a genetic condition by which mutational inactivation of one allele leads to reduced protein levels and functional impairment. Translational enhancement of the spare allele could exert a therapeutic effect. Here we developed BOOST, a novel gene-editing approach to rescue haploinsufficiency loci by the change of specific single nucleotides in the Kozak sequence, which controls translation by regulating start codon recognition. We evaluated for translational strength 230 Kozak sequences of annotated human haploinsufficient genes and 4621 derived variants, which can be installed by base editing, by a high-throughput reporter assay. Of these variants, 149 increased the translation of 47 Kozak sequences, demonstrating that a substantial proportion of haploinsufficient genes are controlled by suboptimal Kozak sequences. Validation of 18 variants for 8 genes produced an average enhancement in an expression window compatible with the rescue of the genetic imbalance. Base editing of the NCF1 gene, whose monoallelic loss causes chronic granulomatous disease, resulted in the desired increase of NCF1 (p47phox) protein levels in a relevant cell model. We propose BOOST as a fine-tuned approach to modulate translation, applicable to the correction of dozens of haploinsufficient monogenic disorders independently of the causing mutation.
Coding sequence evolution is influenced by both natural selection and neutral evolutionary forces. In many species, the effects of mutation bias, codon usage, and GC-biased gene conversion (gBGC) on gene sequence evolution have not been detailed. Quantification of how these forces shape substitution patterns is therefore necessary to understand the strength and direction of natural selection. Here, we used comparative genomics to investigate the association between base composition and codon usage bias on gene sequence evolution in butterflies and moths (Lepidoptera), including an in-depth analysis of underlying patterns and processes in one species, Leptidea sinapis. The data revealed significant G/C to A/T substitution bias at third codon position with some variation in the strength among different butterfly lineages. However, the substitution bias was lower than expected from previously estimated mutation rate ratios, partly due to the influence of gBGC. We found that A/T-ending codons were overrepresented in most species, but there was a positive association between the magnitude of codon usage bias and GC-content in third codon positions. In addition, the tRNA-gene population in L. sinapis showed higher GC-content at third codon positions compared to coding sequences in general and less overrepresentation of A/T-ending codons. There was an inverse relationship between synonymous substitutions and codon usage bias indicating selection on synonymous sites. We conclude that the evolutionary rate in Lepidoptera is affected by a complex interaction between underlying G/C -> A/T mutation bias and partly counteracting fixation biases, predominantly conferred by overall purifying selection, gBGC, and selection on codon usage.
The flipping-out of a DNA base from the double helical structure is a key step of many cellular processes, such as DNA replication, modification and repair. Base pair opening is the first step of base flipping and the exact mechanism is still not well understood. We investigate sequence effects on base pair opening using extensive classical molecular dynamics simulations targeting the opening of 11 different canonical base pairs in two DNA sequences. Two popular biomolecular force fields are applied. To enhance sampling and calculate free energies, we bias the simulation along a simple distance coordinate using a newly developed adaptive sampling algorithm. The simulation is guided back and forth along the coordinate, allowing for multiple opening pathways. We compare the calculated free energies with those from an NMR study and check assumptions of the model used for interpreting the NMR data. Our results further show that the neighboring sequence is an important factor for the opening free energy, but also indicates that other sequence effects may play a role. All base pairs are observed to have a propensity for opening toward the major groove. The preferred opening base is cytosine for GC base pairs, while for AT there is sequence dependent competition between the two bases. For AT opening, we identify two non-canonical base pair interactions contributing to a local minimum in the free energy profile. For both AT and CG we observe long-lived interactions with water and with sodium ions at specific sites on the open base pair.
Base calling is a critical step in the Solexa next-generation sequencing procedure. It compares the position-specific intensity measurements that reflect the signal strength of four possible bases (A, C, G, T) at each genomic position, and outputs estimates of the true sequences for short reads of DNA or RNA. We present a Bayesian method of base calling, BM-BC, for Solexa-GA sequencing data. The Bayesian method builds on a hierarchical model that accounts for three sources of noise in the data, which are known to affect the accuracy of the base calls: fading, phasing, and cross-talk between channels. We show that the new method improves the precision of base calling compared with currently leading methods. Furthermore, the proposed method provides a probability score that measures the confidence of each base call. This probability score can be used to estimate the false discovery rate of the base calling or to rank the precision of the estimated DNA sequences, which in turn can be useful for downstream analysis such as sequence alignment.
8-Oxoguanine (8-oxoG) is a major product of oxidative DNA damage, which induces replication errors and interferes with transcription. By varying the position of single 8-oxoG in a functional gene and manipulating the nucleotide sequence surrounding the lesion, we found that the degree of transcriptional inhibition is independent of the distance from the transcription start or the localization within the transcribed or the non-transcribed DNA strand. However, it is strongly dependent on the sequence context and also proportional to cellular expression of 8-oxoguanine DNA glycosylase (OGG1)-demonstrating that transcriptional arrest does not take place at unrepaired 8-oxoG and proving a causal connection between 8-oxoG excision and the inhibition of transcription. We identified the 5'-CAGGGC[8-oxoG]GACTG-3' motif as having only minimal transcription-inhibitory potential in cells, based on which we predicted that 8-oxoG excision is particularly inefficient in this sequence context. This anticipation was fully confirmed by direct biochemical assays. Furthermore, in DNA containing a bistranded Cp[8-oxoG]/Cp[8-oxoG] clustered lesion, the excision rates differed between the two strands at least by a factor of 9, clearly demonstrating that the excision preference is defined by the DNA strand asymmetry rather than the overall geometry of the double helix or local duplex stability.
Expanding mitochondrial base editing tools with broad sequence compatibility is of high need for both research and therapeutic purposes. In this study, we identify a DddA homolog from Simiaoa sunii (Ddd_Ss) which can efficiently deaminate cytosine in DC context in double-stranded DNA (dsDNA). We successfully develop Ddd_Ss-derived cytosine base editors (DdCBE_Ss) and introduce mutations at multiple mitochondrial DNA (mtDNA) loci including disease-associated mtDNA mutations in previously inaccessible GC context. Finally, by introducing a single amino acid substitution from Ddd_Ss, we successfully improve the activity and sequence compatibility of DdCBE derived from DddA of Burkholderia cenocepacia (DdCBE_Bc). Our study expands mtDNA editing tool boxes and provides resources for further screening and engineering dsDNA base editors for biological and therapeutic applications.
Base-stacking is a key factor in the energetics that determines nucleic acid structure. We measure the tensile response of single-stranded DNA as a function of sequence and monovalent salt concentration to examine the effects of base-stacking on the mechanical and thermodynamic properties of single-stranded DNA. By comparing the elastic response of highly stacked poly(dA) and that of a polypyrimidine sequence with minimal stacking, we find that base-stacking in poly(dA) significantly enhances the polymer's rigidity. The unstacking transition of poly(dA) at high force reveals that the intrinsic electrostatic tension on the molecule varies significantly more weakly on salt concentration than mean-field predictions. Further, we provide a model-independent estimate of the free energy difference between stacked poly(dA) and unstacked polypyrimidine, finding it to be ∼-0.25 kBT/base and nearly constant over three orders of magnitude in salt concentration.
We present a method for obtaining long haplotypes, of over 3 kb in length, using a short-read sequencer, Barcode-directed Assembly for Extra-long Sequences (BAsE-Seq). BAsE-Seq relies on transposing a template-specific barcode onto random segments of the template molecule and assembling the barcoded short reads into complete haplotypes. We applied BAsE-Seq on mixed clones of hepatitis B virus and accurately identified haplotypes occurring at frequencies greater than or equal to 0.4%, with >99.9% specificity. Applying BAsE-Seq to a clinical sample, we obtained over 9,000 viral haplotypes, which provided an unprecedented view of hepatitis B virus population structure during chronic infection. BAsE-Seq is readily applicable for monitoring quasispecies evolution in viral diseases.
4-Aminobiphenyl (ABP) and its structure analog 2-aminofluorene (AF) are well-known carcinogens. In the present work, an unusual sequence effect in the 5'-CTTCTG1G2TCCTCATTC-3' DNA duplex is reported for ABP- and AF-modified G. Specifically, the ABP modification at G1 resulted in a mixture of 67% major groove B-type (B) and 33% stacked (S) conformers, while at the ABP modification at G2 exclusively resulted in the B-conformer. The AF modification at G1 and G2 lead to 25%:75% and 83%:17% B:S population ratios, respectively. These differences in preferred conformation are due to an interplay between stabilizing (hydrogen bonding and stacking that is enhanced by lesion planarity) and destabilizing (solvent exposure) forces at the lesion site. Furthermore, while the B-conformer is a thermodynamic stabilizer and the S-conformer is a destabilizer in duplex settings, the situation is reversed at the single strands/double strands (ss/ds) junction. Specifically, the twisted biphenyl is a better stacker at the ss/ds junction than the coplanar AF. Therefore, the ABP modification leads to a stronger strand binding affinity of the ss/ds junction than the AF modification. Overall, the current work provides conformational insights into the role of sequence and lesion effects in modulating DNA replication.
Organometallic ruthenium(II) complexes [(η⁶-arene)Ru(en)Cl][PF₆] (arene = benzene (1), p-cymene (2), indane (3), and biphenyl (4); en = ethylenediamine) are promising anticancer drug candidates both in vitro and in vivo. In this paper, the interactions between ruthenium(II) complexes and 15-mer single- and double-stranded oligodeoxynucleotides (ODNs) were thermodynamically investigated using high performance liquid chromatography (HPLC) and electrospray ionization mass spectroscopy (ESI-MS). All of the complexes bind preferentially to G₈ on the single strand 5'-CTCTCTT₇G₈T₉CTTCTC-3' (I), with complex 4 containing the most hydrophobic ligand as the most reactive one. To the analogs of I (changing T₇ and/or T₉ to A and/or C), complex 4 shows a decreasing affinity to the G₈ site in the following order: -AG₈T- (K: 5.74 × 10⁴ M-1) > -CG₈C- > -TG₈A- > -AG₈A- > -AG₈C- > -TG₈T- (I) ≈ -CG₈A- (K: 2.81 × 10⁴ M-1). In the complementary strand of I, the G bases in the middle region are favored for ruthenation over guanine (G) bases in the end of oligodeoxynucleotides (ODNs). These results indicate that both the flanking bases (or base sequences) and the arene ligands play important roles in determining the binding preference, and the base- and sequence-selectivity, of ruthenium complex in binding to the ODNs.
5-Hydroxymethylcytosine (5hmC), 5-formylcytosine (5fC), and 5-carboxylcytosine (5caC) form during active demethylation of 5-methylcytosine (5mC) and are implicated in epigenetic regulation of the genome. They are differentially processed by thymine DNA glycosylase (TDG), an enzyme involved in active demethylation of 5mC. Three modified Dickerson-Drew dodecamer (DDD) sequences, amenable to crystallographic and spectroscopic analyses and containing the 5'-CG-3' sequence associated with genomic cytosine methylation, containing 5hmC, 5fC, or 5caC placed site-specifically into the 5'-T(8)X(9)G(10)-3' sequence of the DDD, were compared. The presence of 5caC at the X(9) base increased the stability of the DDD, whereas 5hmC or 5fC did not. Both 5hmC and 5fC increased imino proton exchange rates and calculated rate constants for base pair opening at the neighboring base pair A(5):T(8), whereas 5caC did not. At the oxidized base pair G(4):X(9), 5fC exhibited an increase in the imino proton exchange rate and the calculated kop. In all cases, minimal effects to imino proton exchange rates occurred at the neighboring base pair C(3):G(10). No evidence was observed for imino tautomerization, accompanied by wobble base pairing, for 5hmC, 5fC, or 5caC when positioned at base pair G(4):X(9); each favored Watson-Crick base pairing. However, both 5fC and 5caC exhibited intranucleobase hydrogen bonding between their formyl or carboxyl oxygens, respectively, and the adjacent cytosine N(4) exocyclic amines. The lesion-specific differences observed in the DDD may be implicated in recognition of 5hmC, 5fC, or 5caC in DNA by TDG. However, they do not correlate with differential excision of 5hmC, 5fC, or 5caC by TDG, which may be mediated by differences in transition states of the enzyme-bound complexes.
The consensus sequence of p53 is repeated half sites of PuPuPuC(A/T)(A/T)GPyPyPy. GtAGCAttAGCCCAGACATGTCC is a 14-3-3sigma promoter p53 regulation site; the first core sequence is CAttAG, and the second is CATG. Both mutants GtAGgAttAGCCCAGACATGTCC and GtAGCAttAGCCCAGACATcTCC can be activated by p53 as a 1.5-fold half site. The original p53 regulated site on the 14-3-3sigma promoter is a whole site, and CATTAG is a functional core sequence. The p53-binding affinity and the activity of CATTAG were lower than for the mutant CATATG core sequence. Wild-type p53 acts as a tetramer to bind to the whole site; however, it also can bind to a half site by one of its dimers. Wild-type p53 can only bind to a half site with core sequence CATG but not to CATATG. The 1.5-fold half site or whole site with core sequence CATATG can be bound by wild-type p53. A p53 mutant, A344, forms dimeric p53; it can only bind to CATG, and not to CATATG. Therefore, tetrameric and dimeric p53 can bind to a two-base A/T gap core sequence, but only tetrameric p53 can bind to a four-base A/T gap core sequence.
The programmable sequence specificity of CRISPR has found uses in gene editing and diagnostics. This manuscript describes an additional application of CRISPR through a family of novel DNA enrichment technologies. CAMP (CRISPR Associated Multiplexed PCR) and cCAMP (chimeric CRISPR Associated Multiplexed PCR) utilize the sequence specificity of the Cas9/sgRNA complex to target loci for the ligation of a universal adapter that is used for subsequent amplification. cTRACE (chimeric Targeting Rare Alleles with CRISPR-based Enrichment) also applies this method to use Cas9/sgRNA to target loci for the addition of universal adapters, however it has an additional selection for specific mutations through the use of an allele-specific primer. These three methods can produce multiplex PCR that significantly reduces the optimization required for every target. The methods are also not specific to any downstream analytical platform. We additionally will present a mutation specific enrichment technology that is non-amplification based and leaves the DNA in its native state: TRACE (Targeting Rare Alleles with CRISPR-based Enrichment). TRACE utilizes the Cas9/sgRNA complex to sterically protect the ends of targeted sequences from exonuclease activity which digests both the normal variant as well as any off-target sequences.
Pyrrolo[2,1-c][1,4]benzodiazepine (PBD) dimers are synthetic sequence-selective DNA minor-groove cross-linking agents that possess two electrophilic imine moieties (or their equivalent) capable of forming covalent aminal linkages with guanine C2-NH(2) functionalities. The PBD dimer SJG-136, which has a C8-O-(CH(2))(3)-O-C8'' central linker joining the two PBD moieties, is currently undergoing phase II clinical trials and current research is focused on developing analogues of SJG-136 with different linker lengths and substitution patterns. Using a reversed-phase ion pair HPLC/MS method to evaluate interaction with oligonucleotides of varying length and sequence, we recently reported (JACS, 2009, 131, 13 756) that SJG-136 can form three different types of adducts: inter- and intrastrand cross-linked adducts, and mono-alkylated adducts. These studies have now been extended to include PBD dimers with a longer central linker (C8-O-(CH(2))(5)-O-C8'), demonstrating that the type and distribution of adducts appear to depend on (i) the length of the C8/C8'-linker connecting the two PBD units, (ii) the positioning of the two reactive guanine bases on the same or opposite strands, and (iii) their separation (i.e. the number of base pairs, usually ATs, between them). Based on these data, a set of rules are emerging that can be used to predict the DNA-interaction behaviour of a PBD dimer of particular C8-C8' linker length towards a given DNA sequence. These observations suggest that it may be possible to design PBD dimers to target specific DNA sequences.
DNA methylation maintenance by DNMT1 is an essential process in mammals but molecular mechanisms connecting DNA methylation patterns and enzyme activity remain elusive. Here, we systematically analyzed the specificity of DNMT1, revealing a pronounced influence of the DNA sequences flanking the target CpG site on DNMT1 activity. We determined DNMT1 structures in complex with preferred DNA substrates revealing that DNMT1 employs flanking sequence-dependent base flipping mechanisms, with large structural rearrangements of the DNA correlating with low catalytic activity. Moreover, flanking sequences influence the conformational dynamics of the active site and cofactor binding pocket. Importantly, we show that the flanking sequence preferences of DNMT1 highly correlate with genomic methylation in human and mouse cells, and 5-azacytidine triggered DNA demethylation is more pronounced at CpG sites with flanks disfavored by DNMT1. Overall, our findings uncover the intricate interplay between CpG-flanking sequence, DNMT1-mediated base flipping and the dynamic landscape of DNA methylation.
Base J (β-D-glucosyl-hydroxymethyluracil) replaces 1% of T in the Leishmania genome and is only found in telomeric repeats (99%) and in regions where transcription starts and stops. This highly restricted distribution must be co-determined by the thymidine hydroxylases (JBP1 and JBP2) that catalyze the initial step in J synthesis. To determine the DNA sequences recognized by JBP1/2, we used SMRT sequencing of DNA segments inserted into plasmids grown in Leishmania tarentolae. We show that SMRT sequencing recognizes base J in DNA. Leishmania DNA segments that normally contain J also picked up J when present in the plasmid, whereas control sequences did not. Even a segment of only 10 telomeric (GGGTTA) repeats was modified in the plasmid. We show that J modification usually occurs at pairs of Ts on opposite DNA strands, separated by 12 nucleotides. Modifications occur near G-rich sequences capable of forming G-quadruplexes and JBP2 is needed, as it does not occur in JBP2-null cells. We propose a model whereby de novo J insertion is mediated by JBP2. JBP1 then binds to J and hydroxylates another T 13 bp downstream (but not upstream) on the complementary strand, allowing JBP1 to maintain existing J following DNA replication.
The essential deamination of adenosine A34 to inosine at the wobble base is the individual tRNA modification with the greatest effects on mRNA decoding, empowering a single tRNA to translate three different codons. To date, many aspects of how eukaryotic deaminases specifically select their multiple substrates remain unclear. Here, using cryo-EM, we present the structure of a eukaryotic ADAT2/3 deaminase bound to a full-length tRNA, revealing that the enzyme distorts the anticodon loop, but in contrast to the bacterial enzymes, selects its substrate via sequence-independent contacts of eukaryote-acquired flexible or intrinsically unfolded motifs distal from the conserved catalytic core. A gating mechanism for substrate entry to the active site is identified. Our multi-step tRNA recognition model yields insights into how RNA editing by A34 deamination evolved, shaped the genetic code, and directly impacts the eukaryotic proteome.
Welcome to the FDI Lab - SciCrunch.org Resources search. From here you can search through a compilation of resources used by FDI Lab - SciCrunch.org and see how data is organized within our community.
You are currently on the Community Resources tab looking through categories and sources that FDI Lab - SciCrunch.org has compiled. You can navigate through those categories from here or change to a different tab to execute your search through. Each tab gives a different perspective on data.
If you have an account on FDI Lab - SciCrunch.org then you can log in from here to get additional features in FDI Lab - SciCrunch.org such as Collections, Saved Searches, and managing Resources.
Here is the search term that is being executed, you can type in anything you want to search for. Some tips to help searching:
You can save any searches you perform for quick access to later from here.
We recognized your search term and included synonyms and inferred terms along side your term to help get the data you are looking for.
If you are logged into FDI Lab - SciCrunch.org you can add data records to your collections to create custom spreadsheets across multiple sources of data.
Here are the facets that you can filter your papers by.
From here we'll present any options for the literature, such as exporting your current results.
If you have any further questions please check out our FAQs Page to ask questions and see our tutorials. Click this button to view this tutorial again.
Year:
Count: