2024MAY03: Our hosting provider has resolved some DB connectivity issues. We may experience some more outages as the issue is resolved. We apologize for the inconvenience. Dismiss and don't show again

Searching across hundreds of databases

Our searching services are busy right now. Your search will reload in five seconds.

X
Forgot Password

If you have forgotten your password you can enter your email here and get a temporary password sent to your email.

X
Forgot Password

If you have forgotten your password you can enter your email here and get a temporary password sent to your email.

Miniature Inverted-repeat Transposable Elements Drive Rapid MicroRNA Diversification in Angiosperms.

Molecular biology and evolution | 2022

MicroRNAs (miRNAs) are fast evolving endogenous small RNAs that regulate organism function and behavior in both animals and plants. Although models for de novo miRNA biogenesis have been proposed, the genomic mechanisms driving swift diversification of the miRNA repertoires in plants remain elusive. Here, by comprehensively analyzing 21 phylogenetically representative plant species, ranging from green algae to angiosperms, we systematically identified de novo miRNA events associated with 8,649 miRNA loci. We found that 399 (4.6%), 466 (5.4%), and 1,402 (16.2%) miRNAs were derived from inverted gene duplication events, long terminal repeats of retrotransposons, and miniature inverted-repeat transposable elements (MITEs), respectively. Among the miRNAs of these origins, MITEs, especially those belonging to the Mutator, Tc1/Mariner, and PIF/Harbinger superfamilies, were the predominant genomic source for de novo miRNAs in the 15 examined angiosperms but not in the six non-angiosperms. Our data further illustrated a transposition-transcription process by which MITEs are converted into new miRNAs (termed MITE-miRNAs) whereby properly sized MITEs are transcribed and therefore become potential substrates for the miRNA processing machinery by transposing into introns of active genes. By analyzing the 58,038 putative target genes for the 8,095 miRNAs, we found that the target genes of MITE-miRNAs were preferentially associated with response to environmental stimuli such as temperature, suggesting that MITE-miRNAs are pertinent to plant adaptation. Collectively, these findings demonstrate that molecular conversion of MITEs is a genomic mechanism leading to rapid and continuous changes to the miRNA repertoires in angiosperm.

Pubmed ID: 36223453 RIS Download

Publication data is provided by the National Library of Medicine ® and PubMed ®. Data is retrieved from PubMed ® on a weekly schedule. For terms and conditions see the National Library of Medicine Terms and Conditions.

This is a list of tools and resources that we have found mentioned in this publication.


BLASTN (tool)

RRID:SCR_001598

Web application to search nucleotide databases using a nucleotide query. Algorithms: blastn, megablast, discontiguous megablast.

View all literature mentions

miRBase (tool)

RRID:SCR_003152

Central online repository for microRNA nomenclature, sequence data, annotation and target prediction.Collection of published miRNA sequences and annotation.

View all literature mentions

GigaDB (tool)

RRID:SCR_004002

Repository to host data and tools associated with articles in GigaScience; however, it also includes a subset of datasets that are not associated with GigaScience articles. GigaDB defines a dataset as a group of files (e.g., sequencing data, analyses, imaging files, software programs) that are related to and support an article or study. Through their association with DataCite, each dataset will be assigned a DOI that can be used as a standard citation for future use of these data in other articles by the authors and other researchers. Datasets in GigaDB all require a title that is specific to the dataset, an author list, and an abstract that provides information specific to the data included within the set. Detailed information about the data to be submitted is encouraged in ISA-Tab, a format used by the BioSharing and ISA Commons communities that they work with to maintain the highest data and metadata standards in their journal.

View all literature mentions

SourceForge (tool)

RRID:SCR_004365

Web based service that offers software developers centralized online location to control and manage free and open source software projects. Open source software tool and business public software platform.

View all literature mentions

NCBI BioProject (tool)

RRID:SCR_004801

Database of biological data related to a single initiative, originating from a single organization or from a consortium. A BioProject record provides users a single place to find links to the diverse data types generated for that project. It is a searchable collection of complete and incomplete (in-progress) large-scale sequencing, assembly, annotation, and mapping projects for cellular organisms. Submissions are supported by a web-based Submission Portal. The database facilitates organization and classification of project data submitted to NCBI, EBI and DDBJ databases that captures descriptive information about research projects that result in high volume submissions to archival databases, ties together related data across multiple archives and serves as a central portal by which to inform users of data availability. BioProject records link to corresponding data stored in archival repositories. The BioProject resource is a redesigned, expanded, replacement of the NCBI Genome Project resource. The redesign adds tracking of several data elements including more precise information about a project''''s scope, material, and objectives. Genome Project identifiers are retained in the BioProject as the ID value for a record, and an Accession number has been added. Database content is exchanged with other members of the International Nucleotide Sequence Database Collaboration (INSDC). BioProject is accessible via FTP.

View all literature mentions

Bowtie (tool)

RRID:SCR_005476

Software ultrafast memory efficient tool for aligning sequencing reads. Bowtie is short read aligner.

View all literature mentions

InterPro (tool)

RRID:SCR_006695

Service providing functional analysis of proteins by classifying them into families and predicting domains and important sites. They combine protein signatures from a number of member databases into a single searchable resource, capitalizing on their individual strengths to produce a powerful integrated database and diagnostic tool. This integrated database of predictive protein signatures is used for the classification and automatic annotation of proteins and genomes. InterPro classifies sequences at superfamily, family and subfamily levels, predicting the occurrence of functional domains, repeats and important sites. InterPro adds in-depth annotation, including GO terms, to the protein signatures. You can access the data programmatically, via Web Services. The member databases use a number of approaches: # ProDom: provider of sequence-clusters built from UniProtKB using PSI-BLAST. # PROSITE patterns: provider of simple regular expressions. # PROSITE and HAMAP profiles: provide sequence matrices. # PRINTS provider of fingerprints, which are groups of aligned, un-weighted Position Specific Sequence Matrices (PSSMs). # PANTHER, PIRSF, Pfam, SMART, TIGRFAMs, Gene3D and SUPERFAMILY: are providers of hidden Markov models (HMMs). Your contributions are welcome. You are encouraged to use the ''''Add your annotation'''' button on InterPro entry pages to suggest updated or improved annotation for individual InterPro entries.

View all literature mentions

FASTA (tool)

RRID:SCR_011819

Software package for DNA and protein sequence alignment to find regions of local or global similarity between Protein or DNA sequences, either by searching Protein or DNA databases, or by identifying local duplications within a sequence.

View all literature mentions

Trim Galore (tool)

RRID:SCR_011847

Software tool to automate quality and adapter trimming as well as quality control, with some added functionality to remove biased methylation positions for RRBS sequence files for directional, non-directional or paired-end sequencing. Wrapper around Cutadapt and FastQC to consistently apply adapter and quality trimming to FastQ files, with extra functionality for Reduced Representation Bisulfite Sequencing data.

View all literature mentions

BLAT (tool)

RRID:SCR_011919

Software designed to quickly find sequences of 95% and greater similarity of length 25 bases or more.

View all literature mentions

RepeatMasker (tool)

RRID:SCR_012954

Software tool that screens DNA sequences for interspersed repeats and low complexity DNA sequences. The output of the program is a detailed annotation of the repeats that are present in the query sequence as well as a modified version of the query sequence in which all the annotated repeats have been masked (default: replaced by Ns). Currently over 56% of human genomic sequence is identified and masked by the program. Sequence comparisons in RepeatMasker are performed by one of several popular search engines including nhmmer, cross_match, ABBlast/WUBlast, RMBlast and Decypher. RepeatMasker makes use of curated libraries of repeats and currently supports Dfam ( profile HMM library ) and RepBase ( consensus sequence library ).

View all literature mentions

psRNATarget (tool)

RRID:SCR_013321

A plant small RNA target analysis server which features two important analysis functions: 1) reverse complementary matching between miRNA and target transcript using a proven scoring schema, and 2) target site accessibility evaluation by calculating unpaired energy (UPE) required to ?open? secondary structure around miRNA?s target site on mRNA. PsRNATarget incorporates recent discoveries in plant miRNA target recognition, e.g. it distinguishes translational and post-transcriptional inhibition, and it reports the number of miRNA/target site pairs that may affect miRNA binding activity to target transcript. PsRNATarget is designed for high-throughput analysis of next-generation data with an efficient distributed computing back-end pipeline that runs on a Linux cluster. The server front-end integrates three simplified user-friendly interfaces to accept user-submitted or preloaded miRNAs and transcript sequences; and outputs a comprehensive list of miRNA / target pairs along with the online tools for batch downloading, key word searching and results sorting.

View all literature mentions

Cufflinks (tool)

RRID:SCR_014597

Software tool for transcriptome assembly and differential expression analysis for RNA-Seq. Includes script called cuffmerge that can be used to merge together several Cufflinks assemblies. It also handles running Cuffcompare as well as automatically filtering a number of transfrags that are likely to be artifacts. If the researcher has a reference GTF file, the researcher can provide it to the script to more effectively merge novel isoforms and maximize overall assembly quality.

View all literature mentions

LTR_Finder (tool)

RRID:SCR_015247

Web software capable of scanning large-scale sequences for full-length LTR retrotranspsons.

View all literature mentions

TimeTree (tool)

RRID:SCR_021162

Public knowledge base for information on evolutionary timescale of life. Data from thousands of published studies are assembled into searchable tree of life scaled to time.

View all literature mentions