Aphidius gifuensis is a parasitoid wasp that has been commercially bred and released in large scale as a biocontrol agent for the management of aphid pests. As a highly efficient endoparasitoid, it is also an important model for exploring mechanisms of parasitism. Currently, artificially bred populations of this wasp are facing rapid decline with undetermined cause, and mechanisms underlying its parasitoid strategy remain poorly understood. Exploring the mechanism behind its population decline and the host-parasitoid relationship is impeded partly due to the lack of a comprehensive genome data for this species. In this study, we constructed a high-quality reference genome of A. gifuensis using Oxford Nanopore sequencing and Hi-C (proximity ligation chromatin conformation capture) technology. The final genomic assembly was 156.9 Mb, with a contig N50 length of 3.93 Mb, the longest contig length of 10.4 Mb and 28.89% repetitive sequences. 99.8% of genome sequences were anchored onto six linkage groups. A total of 11,535 genes were predicted, of which 90.53% were functionally annotated. Benchmarking Universal Single-Copy Orthologs (BUSCO) analysis showed the completeness of assembled genome is 98.3%. We found significantly expanded gene families involved in metabolic processes, transmembrane transport, cell signal communication and oxidoreductase activity, in particular ATP-binding cassette (ABC) transporter, Cytochrome P450 and venom proteins. The olfactory receptors (ORs) showed significant contraction, which may be associated with the decrease in host recognition. Our study provides a solid foundation for future studies on the molecular mechanisms of population decline as well as host-parasitoid relationship for parasitoid wasps.
Pubmed ID: 33314728 RIS Download
Publication data is provided by the National Library of Medicine ® and PubMed ®. Data is retrieved from PubMed ® on a weekly schedule. For terms and conditions see the National Library of Medicine Terms and Conditions.
Web application to search nucleotide databases using a nucleotide query. Algorithms: blastn, megablast, discontiguous megablast.
View all literature mentionsWeb search tool to find regions of similarity between biological sequences. Program compares nucleotide or protein sequences to sequence databases and calculates statistical significance. Used for identifying homologous sequences.
View all literature mentionsTool for searching sequence databases for homologs of protein sequences, and for making protein sequence alignments. It implements methods using probabilistic models called profile hidden Markov models (profile HMMs). Compared to BLAST, FASTA, and other sequence alignment and database search tools based on older scoring methodology, HMMER aims to be significantly more accurate and more able to detect remote homologs because of the strength of its underlying mathematical models. In the past, this strength came at significant computational expense, but in the new HMMER3 project, HMMER is now essentially as fast as BLAST.
View all literature mentionsThe Rfam database is a collection of RNA families, each represented by multiple sequence alignments, consensus secondary structures and covariance models (CMs). The families in Rfam break down into three broad functional classes: Non-coding RNA genes, structured cis-regulatory elements and self-splicing RNAs. Typically these functional RNAs often have a conserved secondary structure which may be better preserved than the RNA sequence. The CMs used to describe each family are a slightly more complicated relative of the profile hidden Markov models (HMMs) used by Pfam. CMs can simultaneously model RNA sequence and the structure in an elegant and accurate fashion. Rfam is also available via FTP. You can find data in Rfam in various ways... * Analyze your RNA sequence for Rfam matches * View Rfam family annotation and alignments * View Rfam clan details * Query Rfam by keywords * Fetch families or sequences by NCBI taxonomy * Enter any type of accession or ID to jump to the page for a Rfam family, sequence or genome
View all literature mentionsSoftware for gene prediction in eukaryotic genomic sequences. Serves as a basis for further steps in the analysis of sequenced and assembled eukaryotic genomes.
View all literature mentionsSoftware package as multiple alignment program for amino acid or nucleotide sequences. Can align up to 500 sequences or maximum file size of 1 MB. First version of MAFFT used algorithm based on progressive alignment, in which sequences were clustered with help of Fast Fourier Transform. Subsequent versions have added other algorithms and modes of operation, including options for faster alignment of large numbers of sequences, higher accuracy alignments, alignment of non-coding RNA sequences, and addition of new sequences to existing alignments.
View all literature mentionsIntegrated database resource consisting of 16 main databases, broadly categorized into systems information, genomic information, and chemical information. In particular, gene catalogs in completely sequenced genomes are linked to higher-level systemic functions of cell, organism, and ecosystem. Analysis tools are also available. KEGG may be used as reference knowledge base for biological interpretation of large-scale datasets generated by sequencing and other high-throughput experimental technologies.
View all literature mentionsSoftware tool for automated eukaryotic gene structure annotation that reports eukaryotic gene structures as weighted consensus of all available evidence. Used to combine ab intio gene predictions and protein and transcript alignments into weighted consensus gene structures. Inputs include genome sequence, gene predictions, and alignment data (in GFF3 format).
View all literature mentionsSequence analysis software that performs repeat family identification and creates models for sequence data. RepeatModeler utilizes RepeatScout and RECON to identify repeat element boundaries and family relationships.
View all literature mentionsWeb software capable of scanning large-scale sequences for full-length LTR retrotranspsons.
View all literature mentionsSoftware tool as de novo assembler for PacBio and Oxford Nanopore data. It produces assembly from all-vs-all raw read alignments without error correction stage. Allows to read overlapping, rescue missing overlaps, identify low-quality regions and chimaera and produce better consensus.
View all literature mentionsSoftware tool for chromosome scale scaffolding of de novo genome assemblies based on chromatin interactions.Method exploits signal of genomic proximity in Hi-C datasets for ultra long range scaffolding of de novo genome assemblies.
View all literature mentions