Searching across hundreds of databases

Our searching services are busy right now. Your search will reload in five seconds.

X
Forgot Password

If you have forgotten your password you can enter your email here and get a temporary password sent to your email.

X
Forgot Password

If you have forgotten your password you can enter your email here and get a temporary password sent to your email.

Chromosome-level genome assembly of the aphid parasitoid Aphidius gifuensis using Oxford Nanopore sequencing and Hi-C technology.

Molecular ecology resources | 2021

Aphidius gifuensis is a parasitoid wasp that has been commercially bred and released in large scale as a biocontrol agent for the management of aphid pests. As a highly efficient endoparasitoid, it is also an important model for exploring mechanisms of parasitism. Currently, artificially bred populations of this wasp are facing rapid decline with undetermined cause, and mechanisms underlying its parasitoid strategy remain poorly understood. Exploring the mechanism behind its population decline and the host-parasitoid relationship is impeded partly due to the lack of a comprehensive genome data for this species. In this study, we constructed a high-quality reference genome of A. gifuensis using Oxford Nanopore sequencing and Hi-C (proximity ligation chromatin conformation capture) technology. The final genomic assembly was 156.9 Mb, with a contig N50 length of 3.93 Mb, the longest contig length of 10.4 Mb and 28.89% repetitive sequences. 99.8% of genome sequences were anchored onto six linkage groups. A total of 11,535 genes were predicted, of which 90.53% were functionally annotated. Benchmarking Universal Single-Copy Orthologs (BUSCO) analysis showed the completeness of assembled genome is 98.3%. We found significantly expanded gene families involved in metabolic processes, transmembrane transport, cell signal communication and oxidoreductase activity, in particular ATP-binding cassette (ABC) transporter, Cytochrome P450 and venom proteins. The olfactory receptors (ORs) showed significant contraction, which may be associated with the decrease in host recognition. Our study provides a solid foundation for future studies on the molecular mechanisms of population decline as well as host-parasitoid relationship for parasitoid wasps.

Pubmed ID: 33314728 RIS Download

Publication data is provided by the National Library of Medicine ® and PubMed ®. Data is retrieved from PubMed ® on a weekly schedule. For terms and conditions see the National Library of Medicine Terms and Conditions.

This is a list of tools and resources that we have found mentioned in this publication.


BLASTN (tool)

RRID:SCR_001598

Web application to search nucleotide databases using a nucleotide query. Algorithms: blastn, megablast, discontiguous megablast.

View all literature mentions

NCBI BLAST (tool)

RRID:SCR_004870

Web search tool to find regions of similarity between biological sequences. Program compares nucleotide or protein sequences to sequence databases and calculates statistical significance. Used for identifying homologous sequences.

View all literature mentions

Hmmer (tool)

RRID:SCR_005305

Tool for searching sequence databases for homologs of protein sequences, and for making protein sequence alignments. It implements methods using probabilistic models called profile hidden Markov models (profile HMMs). Compared to BLAST, FASTA, and other sequence alignment and database search tools based on older scoring methodology, HMMER aims to be significantly more accurate and more able to detect remote homologs because of the strength of its underlying mathematical models. In the past, this strength came at significant computational expense, but in the new HMMER3 project, HMMER is now essentially as fast as BLAST.

View all literature mentions

Rfam (tool)

RRID:SCR_007891

The Rfam database is a collection of RNA families, each represented by multiple sequence alignments, consensus secondary structures and covariance models (CMs). The families in Rfam break down into three broad functional classes: Non-coding RNA genes, structured cis-regulatory elements and self-splicing RNAs. Typically these functional RNAs often have a conserved secondary structure which may be better preserved than the RNA sequence. The CMs used to describe each family are a slightly more complicated relative of the profile hidden Markov models (HMMs) used by Pfam. CMs can simultaneously model RNA sequence and the structure in an elegant and accurate fashion. Rfam is also available via FTP. You can find data in Rfam in various ways... * Analyze your RNA sequence for Rfam matches * View Rfam family annotation and alignments * View Rfam clan details * Query Rfam by keywords * Fetch families or sequences by NCBI taxonomy * Enter any type of accession or ID to jump to the page for a Rfam family, sequence or genome

View all literature mentions

Augustus (tool)

RRID:SCR_008417

Software for gene prediction in eukaryotic genomic sequences. Serves as a basis for further steps in the analysis of sequenced and assembled eukaryotic genomes.

View all literature mentions

MAFFT (tool)

RRID:SCR_011811

Software package as multiple alignment program for amino acid or nucleotide sequences. Can align up to 500 sequences or maximum file size of 1 MB. First version of MAFFT used algorithm based on progressive alignment, in which sequences were clustered with help of Fast Fourier Transform. Subsequent versions have added other algorithms and modes of operation, including options for faster alignment of large numbers of sequences, higher accuracy alignments, alignment of non-coding RNA sequences, and addition of new sequences to existing alignments.

View all literature mentions

KEGG (tool)

RRID:SCR_012773

Integrated database resource consisting of 16 main databases, broadly categorized into systems information, genomic information, and chemical information. In particular, gene catalogs in completely sequenced genomes are linked to higher-level systemic functions of cell, organism, and ecosystem. Analysis tools are also available. KEGG may be used as reference knowledge base for biological interpretation of large-scale datasets generated by sequencing and other high-throughput experimental technologies.

View all literature mentions

EVidenceModeler (tool)

RRID:SCR_014659

Software tool for automated eukaryotic gene structure annotation that reports eukaryotic gene structures as weighted consensus of all available evidence. Used to combine ab intio gene predictions and protein and transcript alignments into weighted consensus gene structures. Inputs include genome sequence, gene predictions, and alignment data (in GFF3 format).

View all literature mentions

RepeatModeler (tool)

RRID:SCR_015027

Sequence analysis software that performs repeat family identification and creates models for sequence data. RepeatModeler utilizes RepeatScout and RECON to identify repeat element boundaries and family relationships.

View all literature mentions

LTR_Finder (tool)

RRID:SCR_015247

Web software capable of scanning large-scale sequences for full-length LTR retrotranspsons.

View all literature mentions

SMARTdenovo (tool)

RRID:SCR_017622

Software tool as de novo assembler for PacBio and Oxford Nanopore data. It produces assembly from all-vs-all raw read alignments without error correction stage. Allows to read overlapping, rescue missing overlaps, identify low-quality regions and chimaera and produce better consensus.

View all literature mentions

LACHESIS (tool)

RRID:SCR_017644

Software tool for chromosome scale scaffolding of de novo genome assemblies based on chromatin interactions.Method exploits signal of genomic proximity in Hi-C datasets for ultra long range scaffolding of de novo genome assemblies.

View all literature mentions