Paralogous transcription factors (TFs) are oftentimes reported to have identical DNA-binding motifs, despite the fact that they perform distinct regulatory functions. Differential genomic targeting by paralogous TFs is generally assumed to be due to interactions with protein co-factors or the chromatin environment. Using a computational-experimental framework called iMADS (integrative modeling and analysis of differential specificity), we show that, contrary to previous assumptions, paralogous TFs bind differently to genomic target sites even in vitro. We used iMADS to quantify, model, and analyze specificity differences between 11 TFs from 4 protein families. We found that paralogous TFs have diverged mainly at medium- and low-affinity sites, which are poorly captured by current motif models. We identify sequence and shape features differentially preferred by paralogous TFs, and we show that the intrinsic differences in specificity among paralogous TFs contribute to their differential in vivo binding. Thus, our study represents a step forward in deciphering the molecular mechanisms of differential specificity in TF families.
Pubmed ID: 29605182 RIS Download
Publication data is provided by the National Library of Medicine ® and PubMed ®. Data is retrieved from PubMed ® on a weekly schedule. For terms and conditions see the National Library of Medicine Terms and Conditions.
This unknown targets GST Tag
View all literature mentionsSoftware environment and programming language for statistical computing and graphics. R is integrated suite of software facilities for data manipulation, calculation and graphical display. Can be extended via packages. Some packages are supplied with the R distribution and more are available through CRAN family.It compiles and runs on wide variety of UNIX platforms, Windows and MacOS.
View all literature mentionsProgramming language for all operating systems that lets users work more quickly and integrate their systems more effectively. Often compared to Tcl, Perl, Ruby, Scheme or Java. Some of its key distinguishing features include very clear and readable syntax, strong introspection capabilities, intuitive object orientation, natural expression of procedural code, full modularity, exception-based error handling, high level dynamic data types, extensive standard libraries and third party modules for virtually every task, extensions and modules easily written in C, C (or Java for Python, or .NET languages for IronPython), and embeddable within applications as a scripting interface.
View all literature mentions