Searching across hundreds of databases

Our searching services are busy right now. Your search will reload in five seconds.

X
Forgot Password

If you have forgotten your password you can enter your email here and get a temporary password sent to your email.

X
Forgot Password

If you have forgotten your password you can enter your email here and get a temporary password sent to your email.

Modeling and Predicting the Activities of Trans-Acting Splicing Factors with Machine Learning.

Cell systems | 2018

Alternative splicing (AS) is generally regulated by trans-splicing factors that specifically bind to cis-elements in pre-mRNAs. The human genome encodes ∼1,500 RNA binding proteins (RBPs) that potentially regulate AS, yet their functions remain largely unknown. To explore their potential activities, we fused the putative functional domains of RBPs to a sequence-specific RNA-binding domain and systemically analyzed how these engineered factors affect splicing. We discovered that ∼80% of low-complexity domains in endogenous RBPs displayed distinct context-dependent activities in regulating splicing, indicating that AS is under more extensive regulation than previously expected. We developed a machine learning approach to classify and predict the activities of RBPs based on their sequence compositions and further validated this model using endogenous RBPs and synthetic polypeptides. These results represent a systematic inspection, modeling, prediction, and validation of how RBP sequences affect their activities in controlling splicing, paving the way for de novo engineering of artificial splicing factors.

Pubmed ID: 30414922 RIS Download

Publication data is provided by the National Library of Medicine ® and PubMed ®. Data is retrieved from PubMed ® on a weekly schedule. For terms and conditions see the National Library of Medicine Terms and Conditions.

This is a list of tools and resources that we have found mentioned in this publication.


New England Biolabs (tool)

RRID:SCR_013517

An Antibody supplier

View all literature mentions

Anti-rabbit IgG, HRP-linked Antibody (antibody)

RRID:AB_2099233

This polyclonal secondary targets IgG

View all literature mentions

goat anti-rabbit IgG-HRP (antibody)

RRID:AB_631746

This polyclonal targets goat anti-rabbit IgG-HRP

View all literature mentions

GAPDH antibody (antibody)

RRID:AB_2737588

This monoclonal targets GAPDH

View all literature mentions

GAPDH Antibody (FL-335) (antibody)

RRID:AB_10167668

This polyclonal targets GAPDH

View all literature mentions

Biopython (software development tool)

RRID:SCR_007173

Biopython is a set of freely available tools for biological computation written in Python by an international team of developers. It is a distributed collaborative effort to develop Python libraries and applications which address the needs of current and future work in bioinformatics. The source code is made available under the Biopython License, which is extremely liberal and compatible with almost every license in the world. It works along with the Open Bioinformatics Foundation, who generously host it''s website, bug tracker, and mailing lists. Sponsor: This resource is supported by the Open Bioinformatics Foundation. Keywords: Tool, Software, Python, Biological, Computation, Bioinformatics,

View all literature mentions

Image Lab Software (software resource)

RRID:SCR_014210

Imaging software used to acquire and analyze images from specific Bio-Rad imaging systems. Users can analyze gel or blot features, capture optimized image data, and generate a report of the data. Image Lab software exclusively runs on the Gel Doc EZ imager, Gel Doc XR+ imaging system, ChemiDo MP, ChemiDoc XRS+ imaging systems, Criterion Stain Free imager, and the GS-900calibrated densitometer.

View all literature mentions

Python Programming Language (software resource)

RRID:SCR_008394

Programming language for all operating systems that lets users work more quickly and integrate their systems more effectively. Often compared to Tcl, Perl, Ruby, Scheme or Java. Some of its key distinguishing features include very clear and readable syntax, strong introspection capabilities, intuitive object orientation, natural expression of procedural code, full modularity, exception-based error handling, high level dynamic data types, extensive standard libraries and third party modules for virtually every task, extensions and modules easily written in C, C (or Java for Python, or .NET languages for IronPython), and embeddable within applications as a scripting interface.

View all literature mentions

ImageQuant (data processing software)

RRID:SCR_014246

Software for automatic general image analysis. It provides fully automatic analysis of 1-D gels including lane creation, background subtraction, band detection, molecular weight calibration, quantity calibration, and normalization. Editing tools are provided for cropping, rotating, and filtering images.

View all literature mentions

R Project for Statistical Computing (software resource)

RRID:SCR_001905

Software environment and programming language for statistical computing and graphics. R is integrated suite of software facilities for data manipulation, calculation and graphical display. Can be extended via packages. Some packages are supplied with the R distribution and more are available through CRAN family.It compiles and runs on wide variety of UNIX platforms, Windows and MacOS.

View all literature mentions

HEK293T (cell line)

RRID:CVCL_0063

Cell line HEK293T is a Transformed cell line with a species of origin Homo sapiens (Human)

View all literature mentions

RStudio (software resource)

RRID:SCR_000432

Open source and enterprise ready professional software for R statistical computing environment. Integrated development environment for R. Includes console, syntax highlighting editor that supports direct code execution, as well as tools for plotting, history, debugging and workspace management. Available in open source and commercial editions and runs on desktop Windows, Mac, and Linux or in browser connected to RStudio Server or RStudio Server Pro (Debian/Ubuntu, RedHat/CentOS, and SUSE Linux).

View all literature mentions

Biopython (software development tool)

RRID:SCR_007173

Biopython is a set of freely available tools for biological computation written in Python by an international team of developers. It is a distributed collaborative effort to develop Python libraries and applications which address the needs of current and future work in bioinformatics. The source code is made available under the Biopython License, which is extremely liberal and compatible with almost every license in the world. It works along with the Open Bioinformatics Foundation, who generously host it''s website, bug tracker, and mailing lists. Sponsor: This resource is supported by the Open Bioinformatics Foundation. Keywords: Tool, Software, Python, Biological, Computation, Bioinformatics,

View all literature mentions

HEK293T (cell line)

RRID:CVCL_0063

Cell line HEK293T is a Transformed cell line with a species of origin Homo sapiens (Human)

View all literature mentions