Small RNAs regulate gene expression and most genes in the worm Caenorhabditis elegans are subject to their regulation. Here, we analyze small RNA data sets and use reproducible features of RNAs present in multiple data sets to discover a new class of small RNAs and to reveal insights into two known classes of small RNAs--22G RNAs and 26G RNAs. We found that reproducibly detected 22-nt RNAs, although are predominantly RNAs with a G at the 5' end, also include RNAs with A, C, or U at the 5' end. These RNAs are synthesized downstream from characteristic sequence motifs on mRNA and have U-tailed derivatives. Analysis of 26G RNAs revealed that they are processed from a blunt end of double-stranded RNAs and that production of one 26G RNA generates a hotspot immediately downstream for production of another. To our surprise, analysis of RNAs shorter than 18 nt revealed a new class of RNAs, which we call NU RNAs (pronounced "new RNAs") because they have a NU bias at the 5' end, where N is any nucleotide. NU RNAs are antisense to genes and originate downstream from U bases on mRNA. Although many genes have complementary NU RNAs, their genome-wide distribution is distinct from that of previously known classes of small RNAs. Our results suggest that current approaches underestimate reproducibly detected RNAs that are shorter than 18 nt, and theoretical considerations suggest that such shorter RNAs could be used for sequence-specific gene regulation in organisms like C. elegans that have small genomes.
Pubmed ID: 26647462 RIS Download
Publication data is provided by the National Library of Medicine ® and PubMed ®. Data is retrieved from PubMed ® on a weekly schedule. For terms and conditions see the National Library of Medicine Terms and Conditions.
Software tool as collection of command line tools for Short-Reads FASTA/FASTQ files preprocessing.
View all literature mentionsOpen, web-based platform providing bioinformatics tools and services for data intensive genomic research. Platform may be used as a service or installed locally to perform, reproduce, and share complete analyses. Galaxy automatically tracks and manages data provenance and provides support for capturing the context and intent of computational methods. Galaxy Community has created Galaxy instances in many different forms and for many different applications including Galaxy servers, cloud services that support Galaxy instances, and virtual machines and containers that can be easily deployed for your own server.The Galaxy team is a part of BX at Penn State, and the Biology and Mathematics and Computer Science departments at Emory University.Training Infrastructure as a Service (TIaaS) is a service offered by some UseGalaxy servers to specifically support training use cases.
View all literature mentionsPython 2D plotting library which produces publication quality figures in variety of hardcopy formats and interactive environments across platforms. Used in python scripts, web application servers, and six graphical user interface toolkits. Used to generate plots, histograms, power spectra, bar charts, error charts, scatter plots.
View all literature mentionsWeb application to generate sequence logos, graphical representations of patterns within multiple sequence alignment. Designed to make generation of sequence logos easy. Sequence logo generator.
View all literature mentionsA Python library which provides Python access to and interaction with Galaxy's API and CloudMan. The library allows users to create a CloudMan compute cluster via an API and directly from a local machine, reconnect to an existing CloudMan instance and manipulate it, and interact with Galaxy via a straightforward API and an object-oriented API. The library itself can be used with either service irrespective of the other.
View all literature mentions