Searching across hundreds of databases

Our searching services are busy right now. Your search will reload in five seconds.

X
Forgot Password

If you have forgotten your password you can enter your email here and get a temporary password sent to your email.

X
Forgot Password

If you have forgotten your password you can enter your email here and get a temporary password sent to your email.

A route to de novo domestication of wild allotetraploid rice.

Cell | 2021

Cultivated rice varieties are all diploid, and polyploidization of rice has long been desired because of its advantages in genome buffering, vigorousness, and environmental robustness. However, a workable route remains elusive. Here, we describe a practical strategy, namely de novo domestication of wild allotetraploid rice. By screening allotetraploid wild rice inventory, we identified one genotype of Oryza alta (CCDD), polyploid rice 1 (PPR1), and established two important resources for its de novo domestication: (1) an efficient tissue culture, transformation, and genome editing system and (2) a high-quality genome assembly discriminated into two subgenomes of 12 chromosomes apiece. With these resources, we show that six agronomically important traits could be rapidly improved by editing O. alta homologs of the genes controlling these traits in diploid rice. Our results demonstrate the possibility that de novo domesticated allotetraploid rice can be developed into a new staple cereal to strengthen world food security.

Pubmed ID: 33539781 RIS Download

Publication data is provided by the National Library of Medicine ® and PubMed ®. Data is retrieved from PubMed ® on a weekly schedule. For terms and conditions see the National Library of Medicine Terms and Conditions.

This is a list of tools and resources that we have found mentioned in this publication.


STRUCTURE (tool)

RRID:SCR_002151

Software package for using multi locus genotype data to investigate population structure. Used for inferring presence of distinct populations, assigning individuals to populations, studying hybrid zones, identifying migrants and admixed individuals, and estimating population allele frequencies in situations where many individuals are migrants or admixed. Can be applied to most of commonly used genetic markers, including SNPS, microsatellites, RFLPs and Amplified Fragment Length Polymorphisms.

View all literature mentions

SnpEff (tool)

RRID:SCR_005191

Genetic variant annotation and effect prediction software toolbox that annotates and predicts effects of variants on genes (such as amino acid changes). By using standards, such as VCF, SnpEff makes it easy to integrate with other programs.

View all literature mentions

InterProScan (tool)

RRID:SCR_005829

Software package for functional analysis of sequences by classifying them into families and predicting presence of domains and sites. Scans sequences against InterPro's signatures. Characterizes nucleotide or protein function by matching it with models from several different databases. Used in large scale analysis of whole proteomes, genomes and metagenomes. Available as Web based version and standalone Perl version and SOAP Web Service.

View all literature mentions

PHYLIP (tool)

RRID:SCR_006244

A free package of software programs for inferring phylogenies (evolutionary trees). The source code is distributed (in C), and executables are also distributed. In particular, already-compiled executables are available for Windows (95/98/NT/2000/me/xp/Vista), Mac OS X, and Linux systems. Older executables are also available for Mac OS 8 or 9 systems.

View all literature mentions

OrthoMCL DB: Ortholog Groups of Protein Sequences (tool)

RRID:SCR_007839

OrthoMCL is a genome-scale algorithm for grouping orthologous protein sequences. It provides not only groups shared by two or more species/genomes, but also groups representing species-specific gene expansion families. OrthoMCL starts with reciprocal best hits within each genome as putative in-paralog/recent paralog pairs and reciprocal best hits across any two genomes as putative ortholog pairs. Related proteins are interlinked in a similarity graph. Then MCL (Markov Clustering algorithm,Van Dongen 2000; www.micans.org/mcl) is invoked to split mega-clusters. This process is analogous to the manual review in COG construction. MCL clustering is based on weights between each pair of proteins, so to correct for differences in evolutionary distance the weights are normalized before running MCL.

View all literature mentions

SNAP (tool)

RRID:SCR_007936

A sequence analysis tool providing a simple but detailed analysis of human genes and their variations. For each gene, a gene-gene relationship network can be generated based on protein-protein interaction data, metabolic pathway connections and extended through phylogenetic relations. Snap provides tools for designing sequence primers and evaluating RNA splicing effects of single SNPs - known from the databases or defined by you. Primers can be designed for the amplification or sequencing of cDNA, genomic DNA, introns only or exons only.

View all literature mentions

tRNAscan-SE (tool)

RRID:SCR_010835

Web server to search for tRNA genes in genomic sequence. If you would like to run tRNAscan-SE locally, you can get the UNIX source code (gzip''d tar file).

View all literature mentions

BWA (tool)

RRID:SCR_010910

Software for aligning sequencing reads against large reference genome. Consists of three algorithms: BWA-backtrack, BWA-SW and BWA-MEM. First for sequence reads up to 100bp, and other two for longer sequences ranged from 70bp to 1Mbp.

View all literature mentions

Circos (tool)

RRID:SCR_011798

A software package for visualizing data and information. It visualizes data in a circular layout - this makes Circos ideal for exploring relationships between objects or positions.

View all literature mentions

Infernal (tool)

RRID:SCR_011809

Software for searching DNA sequence databases for RNA structure and sequence similarities.

View all literature mentions

SAM format (tool)

RRID:SCR_012093

A generic alignment format for storing read alignments against reference sequences, supporting short and long reads (up to 128 Mbp) produced by different sequencing platforms.

View all literature mentions

RepeatMasker (tool)

RRID:SCR_012954

Software tool that screens DNA sequences for interspersed repeats and low complexity DNA sequences. The output of the program is a detailed annotation of the repeats that are present in the query sequence as well as a modified version of the query sequence in which all the annotated repeats have been masked (default: replaced by Ns). Currently over 56% of human genomic sequence is identified and masked by the program. Sequence comparisons in RepeatMasker are performed by one of several popular search engines including nhmmer, cross_match, ABBlast/WUBlast, RMBlast and Decypher. RepeatMasker makes use of curated libraries of repeats and currently supports Dfam ( profile HMM library ) and RepBase ( consensus sequence library ).

View all literature mentions

TopHat (tool)

RRID:SCR_013035

Software tool for fast and high throughput alignment of shotgun cDNA sequencing reads generated by transcriptomics technologies. Fast splice junction mapper for RNA-Seq reads. Aligns RNA-Seq reads to mammalian-sized genomes using ultra high-throughput short read aligner Bowtie, and then analyzes mapping results to identify splice junctions between exons.TopHat2 is accurate alignment of transcriptomes in presence of insertions, deletions and gene fusions.

View all literature mentions

Cufflinks (tool)

RRID:SCR_014597

Software tool for transcriptome assembly and differential expression analysis for RNA-Seq. Includes script called cuffmerge that can be used to merge together several Cufflinks assemblies. It also handles running Cuffcompare as well as automatically filtering a number of transfrags that are likely to be artifacts. If the researcher has a reference GTF file, the researcher can provide it to the script to more effectively merge novel isoforms and maximize overall assembly quality.

View all literature mentions

RepeatScout (tool)

RRID:SCR_014653

Algorithm used to identify de novo repeat families in newly sequenced genomes. Repeat libraries for C. briggsae, M. muscles (X chromosome), R. novegicus (X chromosome), armadillo, H. sapiens (X chromosome), and various other mammals created using RepeatScout are available on the main site.

View all literature mentions

EVidenceModeler (tool)

RRID:SCR_014659

Software tool for automated eukaryotic gene structure annotation that reports eukaryotic gene structures as weighted consensus of all available evidence. Used to combine ab intio gene predictions and protein and transcript alignments into weighted consensus gene structures. Inputs include genome sequence, gene predictions, and alignment data (in GFF3 format).

View all literature mentions

Pilon (tool)

RRID:SCR_014731

Software tool to automatically improve draft assemblies and find variation among strains, including large event detection. FASTA files of genome along with one or more BAM files of reads aligned as input. Read alignment analysis is used to identify inconsistencies between input genome and evidence in reads, then attempts to make improvements to genome.

View all literature mentions

BUSCO (tool)

RRID:SCR_015008

Software tool to quantitatively measure genome assembly and annotation completeness based on evolutionarily informed expectations of gene content.

View all literature mentions

RepeatModeler (tool)

RRID:SCR_015027

Sequence analysis software that performs repeat family identification and creates models for sequence data. RepeatModeler utilizes RepeatScout and RECON to identify repeat element boundaries and family relationships.

View all literature mentions

GeneWise (tool)

RRID:SCR_015054

Gene alignment tool from the EBI which predicts gene structure using similar protein sequences. See also the associated GenomeWise tool.

View all literature mentions

Canu (tool)

RRID:SCR_015880

Software for scalable and accurate long-read assembly via adaptive k-mer weighting and repeat separation. Canu is a fork of the Celera Assembler and is designed for high-noise single-molecule sequencing (such as the PacBio RS II/Sequel or Oxford Nanopore MinION).

View all literature mentions

RNAmmer (tool)

RRID:SCR_017075

Software package to predict ribosomal RNA genes in full genome sequences by utilising two levels of Hidden Markov Models. Consistent and rapid annotation of ribosomal RNA genes.

View all literature mentions

LTR_retriever (tool)

RRID:SCR_017623

Software package for identification of long terminal repeat retrotransposons (LTR-RTs). Removes false positives from initial software predictions. Achieves very high specificity, accuracy, and precision without significantly sacrificing sensitivity, hence significantly outperforming existing methods. Can construct LTR libraries directly from self-corrected PacBio reads prior to genome assembly.

View all literature mentions

MCScan (tool)

RRID:SCR_017650

Software package to simultaneously scan multiple genomes to identify homologous chromosomal regions and subsequently align these regions using genes as anchors.Used to identify conserved gene arrays both within same genome and across different genomes. Command line program to wrap dagchainer and combine pairwise results into multi alignments in column format.

View all literature mentions

STRUCTURE (tool)

RRID:SCR_021634

Software package for using multi locus genotype data to investigate population structure. Used for inferring presence of distinct populations, assigning individuals to populations, studying hybrid zones, identifying migrants and admixed individuals, and estimating population allele frequencies in situations where many individuals are migrants or admixed. Can be applied to most of commonly used genetic markers, including SNPS, microsatellites, RFLPs and Amplified Fragment Length Polymorphisms.

View all literature mentions

tRNAscan-SE (tool)

RRID:SCR_008637

Web server to search for tRNA genes in genomic sequence. If you would like to run tRNAscan-SE locally, you can get the UNIX source code (gzip''d tar file).

View all literature mentions

STRUCTURE (tool)

RRID:SCR_017637

Software package for using multi locus genotype data to investigate population structure. Used for inferring presence of distinct populations, assigning individuals to populations, studying hybrid zones, identifying migrants and admixed individuals, and estimating population allele frequencies in situations where many individuals are migrants or admixed. Can be applied to most of commonly used genetic markers, including SNPS, microsatellites, RFLPs and Amplified Fragment Length Polymorphisms.

View all literature mentions