Forgot Password

If you have forgotten your password you can enter your email here and get a temporary password sent to your email.

Forgot Password

If you have forgotten your password you can enter your email here and get a temporary password sent to your email.

Publication

Draft genomes of two blister beetles Hycleus cichorii and Hycleus phaleratus.

Yuan-Ming Wu | Jiang Li | Xiang-Sheng Chen

GigaScience | 2018

Commonly known as blister beetles or Spanish fly, there are more than 1500 species in the Meloidae family (Hexapoda: Coleoptera: Tenebrionoidea) that produce the potent defensive blistering agent cantharidin. Cantharidin and its derivatives have been used to treat cancers such as liver, stomach, lung, and esophageal cancers. Hycleus cichorii and Hycleus phaleratus are the most commercially important blister beetles in China due to their ability to biosynthesize this potent vesicant. However, there is a lack of genome reference, which has hindered development of studies on the biosynthesis of cantharidin and a better understanding of its biology and pharmacology.

Pubmed ID: 29444297 RIS Download

Research resources used in this publication

Additional research tools detected in this publication

SOAPdenovo (RRID:SCR_010752)

Antibodies used in this publication

None found

Associated grants

None

Publication data is provided by the National Library of Medicine ® and PubMed ®. Data is retrieved from PubMed ® on a weekly schedule. For terms and conditions see the National Library of Medicine Terms and Conditions.

This is a list of tools and resources that we have found mentioned in this publication.

SOAPdenovo (tool)

RRID:SCR_010752

THIS RESOURCE IS NO LONGER IN SERVICE. Documented on February 24,2023. Software tool for de novo assembly of human genomes with massively parallel short read sequencing.Short-read assembly method that can build de novo draft assembly for human sized genomes.Software package for assembling short oligonucleotide into contigs and scaffolds.

View all literature mentions

BUSCO (software resource)

RRID:SCR_015008

Software tool to quantitatively measure genome assembly and annotation completeness based on evolutionarily informed expectations of gene content.

View all literature mentions

Peking University; Beijing; China (commercial organization)

RRID:SCR_001193

Chinese research university in Beijing, China that offers undergraduate and graduate degree programs in pure and applied sciences, social sciences and humanities, and sciences of management and education.

View all literature mentions

Honey Bee Genome Project (data or information resource)

RRID:SCR_002890

The HGSC has sequenced the honey bee, Apis mellifera. The version 4.0 assembly was released in March 2006 and published in October 2006. The genome sequence is being upgraded with additional sequence coverage. The honey bee is important in the agricultural community as a producer of honey and as a facilitator of pollination. It is a model organism for studying the following human health issues: immunity, allergic reaction, antibiotic resistance, development, mental health, longevity and diseases of the X chromosome. In addition, biologists are interested in the honey bee's social organization and behavioral traits. This project was proposed to the HGSC by a group of dedicated insect biologists, headed by Gene Robinson. Following a workshop at the HGSC and a honey bee white paper, the HGSC began the project in 2002. A 6-fold coverage WGS, BAC sequence from pooled arrays, and an initial genome assembly (Amel_v1.0) were released beginning in 2003. This has been a challenging project with difficulty in recovering AT-rich regions. The WGS data had lower coverage in AT-rich regions and BAC data from clones showed evidence of internal deletions. Additional reads from AT enriched DNA addressed these underrepresented regions. The current assembly Amel_4.0 was produced with Atlas and includes 2.7 million reads (1.8 Gb) or 7.5x coverage of the (clonable) genome. About 97% of STSs, 98% of ESTs, and 96% of cDNAs are represented in the 231 Mb assembly. About 2,500 reads were also produced from a strain of Africanized honey bee and SNPs were extracted. These were released in dbSNP and the NCBI Trace Archive. Analysis of the genome by a consortium of 20 labs has been completed. This produced a gene list derived from five different methods melded through the GLEAN software. Publications include a main paper in Nature and up to forty companion papers in Genome Research and Insect Molecular Biology. Sponsors: Sequencing of the honey bee is jointly funded by National Human Genome Research Institute (NHGRI) and the Department of Agriculture (USDA). Multiple drones from the same queen (strain DH4) were obtained from Danny Weaver of B. Weaver Apiaries. All libraries were made from DNA isolated from these drones. The honey bee BAC library (CHORI-224) was prepared by Pieter de Jong and Katzutoyo Osoegawa at the Children's Hospital Oakland Research Institute.

View all literature mentions

GENSCAN (service resource)

RRID:SCR_012902

Resource out of service. Documented on February 24,2021.

View all literature mentions

PAML (software resource)

RRID:SCR_014932

Package of programs for phylogenetic analyses of DNA or protein sequences using maximum likelihood. PAML estimates parameters and tests hypotheses to study the evolutionary process from a phylogenetic tree.

View all literature mentions

PhyML (web application)

RRID:SCR_014629

Web phylogeny server based on the maximum-likelihood principle.

View all literature mentions

MAFFT (software resource)

RRID:SCR_011811

Software package as multiple alignment program for amino acid or nucleotide sequences. Can align up to 500 sequences or maximum file size of 1 MB. First version of MAFFT used algorithm based on progressive alignment, in which sequences were clustered with help of Fast Fourier Transform. Subsequent versions have added other algorithms and modes of operation, including options for faster alignment of large numbers of sequences, higher accuracy alignments, alignment of non-coding RNA sequences, and addition of new sequences to existing alignments.

View all literature mentions

SOLAR (software resource)

RRID:SCR_000850

A flexible and extensive software package for genetic variance components analysis, including linkage analysis, quantitative genetic analysis, and covariate screening. Operations are included for calculation of marker-specific or multipoint identity-by-descent (IBD) matrices in pedigrees of arbitrary size and complexity, and for linkage analysis of quantitative traits which may involve multiple loci (oligogenic analysis), dominance effects, and epistasis. (entry from Genetic Analysis Software)

View all literature mentions

Tree families database (data or information resource)

RRID:SCR_013401

A database of phylogenetic trees of animal genes. It aims at developing a curated resource that gives reliable information about ortholog and paralog assignments, and evolutionary history of various gene families. TreeFam defines a gene family as a group of genes that evolved after the speciation of single-metazoan animals. It also tries to include outgroup genes like yeast (S. cerevisiae and S. pombe) and plant (A. thaliana) to reveal these distant members.TreeFam is also an ortholog database. Unlike other pairwise alignment based ones, TreeFam infers orthologs by means of gene trees. It fits a gene tree into the universal species tree and finds historical duplications, speciations and losses events. TreeFam uses this information to evaluate tree building, guide manual curation, and infer complex ortholog and paralog relations.The basic elements of TreeFam are gene families that can be divided into two parts: TreeFam-A and TreeFam-B families. TreeFam-B families are automatically created. They might contain errors given complex phylogenies. TreeFam-A families are manually curated from TreeFam-B ones. Family names and node names are assigned at the same time. The ultimate goal of TreeFam is to present a curated resource for all the families. phylogenetic tree, animal, vertebrate, invertebrate, gene, ortholog, paralog, evolutionary history, gene families, single-metazoan animals, outgroup genes like yeast (S. cerevisiae and S. pombe), plant (A. thaliana), historical duplications, speciations, losses, Human, Genome, comparative genomics

View all literature mentions

Gene Ontology (data or information resource)

RRID:SCR_002811

Computable knowledge regarding functions of genes and gene products. GO resources include biomedical ontologies that cover molecular domains of all life forms as well as extensive compilations of gene product annotations to these ontologies that provide largely species-neutral, comprehensive statements about what gene products do. Used to standardize representation of gene and gene product attributes across species and databases.

View all literature mentions

InterProScan (software resource)

RRID:SCR_005829

Software package for functional analysis of sequences by classifying them into families and predicting presence of domains and sites. Scans sequences against InterPro's signatures. Characterizes nucleotide or protein function by matching it with models from several different databases. Used in large scale analysis of whole proteomes, genomes and metagenomes. Available as Web based version and standalone Perl version and SOAP Web Service.

View all literature mentions

KEGG (software resource)

RRID:SCR_012773

Integrated database resource consisting of 16 main databases, broadly categorized into systems information, genomic information, and chemical information. In particular, gene catalogs in completely sequenced genomes are linked to higher-level systemic functions of cell, organism, and ecosystem. Analysis tools are also available. KEGG may be used as reference knowledge base for biological interpretation of large-scale datasets generated by sequencing and other high-throughput experimental technologies.

View all literature mentions

EVidenceModeler (software resource)

RRID:SCR_014659

Software tool for automated eukaryotic gene structure annotation that reports eukaryotic gene structures as weighted consensus of all available evidence. Used to combine ab intio gene predictions and protein and transcript alignments into weighted consensus gene structures. Inputs include genome sequence, gene predictions, and alignment data (in GFF3 format).

View all literature mentions

UniProt (data or information resource)

RRID:SCR_002380

Collection of data of protein sequence and functional information. Resource for protein sequence and annotation data. Consortium for preservation of the UniProt databases: UniProt Knowledgebase (UniProtKB), UniProt Reference Clusters (UniRef), and UniProt Archive (UniParc), UniProt Proteomes. Collaboration between European Bioinformatics Institute (EMBL-EBI), SIB Swiss Institute of Bioinformatics and Protein Information Resource. Swiss-Prot is a curated subset of UniProtKB.

View all literature mentions

BLASTP (service resource)

RRID:SCR_001010

Data analysis service whose programs search protein databases using a protein query. The algorithms used include blastp, psi-blast, phi-blast, and delta-blast.

View all literature mentions

Augustus (software resource)

RRID:SCR_008417

Software for gene prediction in eukaryotic genomic sequences. Serves as a basis for further steps in the analysis of sequenced and assembled eukaryotic genomes.

View all literature mentions

TopHat (software resource)

RRID:SCR_013035

Software tool for fast and high throughput alignment of shotgun cDNA sequencing reads generated by transcriptomics technologies. Fast splice junction mapper for RNA-Seq reads. Aligns RNA-Seq reads to mammalian-sized genomes using ultra high-throughput short read aligner Bowtie, and then analyzes mapping results to identify splice junctions between exons.TopHat2 is accurate alignment of transcriptomes in presence of insertions, deletions and gene fusions.

View all literature mentions

GeneWise (web application)

RRID:SCR_015054

Gene alignment tool from the EBI which predicts gene structure using similar protein sequences. See also the associated GenomeWise tool.

View all literature mentions

TBLASTN (service resource)

RRID:SCR_011822

Tool to search translated nucleotide databases using a protein query.

View all literature mentions

GapCloser (software resource)

RRID:SCR_015026

Module of SOAPdenovo2 commonly used independently to close gaps in genome assemblies.

View all literature mentions

SSPACE (software resource)

RRID:SCR_005056

A stand-alone software program for scaffolding pre-assembled contigs using paired-read data. Main features are: a short runtime, multiple library input of paired-end and/or mate pair datasets and possible contig extension with unmapped sequence reads.

View all literature mentions

Platanus (software resource)

RRID:SCR_015531

De novo sequence assembler that can reconstruct genomic sequences of highly heterozygous diploids from massively parallel shotgun sequencing data.

View all literature mentions

Jellyfish (software resource)

RRID:SCR_005491

A software tool for fast, memory-efficient counting of k-mers in DNA. A k-mer is a substring of length k, and counting the occurrences of all such substrings is a central step in many analyses of DNA sequence. JELLYFISH can count k-mers quickly by using an efficient encoding of a hash table and by exploiting the compare-and-swap CPU instruction to increase parallelism. Jellyfish is a command-line program that reads FASTA and multi-FASTA files containing DNA sequences. It outputs its k-mer counts in an binary format, which can be translated into a human-readable text format using the jellyfish dump command.

View all literature mentions

ConDeTri (software resource)

RRID:SCR_011838

Software tool as content dependent read trimmer for Illumina data. Content dependent read trimming software for Illumina/Solexa sequencing data.

View all literature mentions

SOAPdenovo (software resource)

RRID:SCR_014986

View all literature mentions

BUSCO (software resource)

RRID:SCR_015008

Software tool to quantitatively measure genome assembly and annotation completeness based on evolutionarily informed expectations of gene content.

View all literature mentions

Peking University; Beijing; China (commercial organization)

RRID:SCR_001193

View all literature mentions

Peking University; Beijing; China (commercial organization)

RRID:SCR_001193

View all literature mentions

BUSCO (software resource)

RRID:SCR_015008

Software tool to quantitatively measure genome assembly and annotation completeness based on evolutionarily informed expectations of gene content.

View all literature mentions

About

The SciCrunch Infrastructure was developed as a cooperative data platform to be used by diverse communities in making data more FAIR.

Contact Us

FAIR Data Informatics Lab

University of California, San Diego

9500 Gilman Drive, Mail Code 0608

La Jolla, CA 92093-0608

United States

info

scicrunch.org

About SciCrunch | Privacy Policy | Terms of Service

Searching across hundreds of databases

Our searching services are busy right now. Your search will reload in five seconds.

Log in

Log in

Publication

Draft genomes of two blister beetles Hycleus cichorii and Hycleus phaleratus.

Research resources used in this publication

Additional research tools detected in this publication

Antibodies used in this publication

Associated grants

This is a list of tools and resources that we have found mentioned in this publication.

RRID:SCR_010752

RRID:SCR_015008

RRID:SCR_001193

RRID:SCR_002890

RRID:SCR_012902

RRID:SCR_014932

RRID:SCR_014629

RRID:SCR_011811

RRID:SCR_000850

RRID:SCR_013401

RRID:SCR_002811

RRID:SCR_005829

RRID:SCR_012773

RRID:SCR_014659

RRID:SCR_002380

RRID:SCR_001010

RRID:SCR_008417

RRID:SCR_013035

RRID:SCR_015054

RRID:SCR_011822

RRID:SCR_015026

RRID:SCR_005056

RRID:SCR_015531

RRID:SCR_005491

RRID:SCR_011838

RRID:SCR_014986

RRID:SCR_015008

RRID:SCR_001193

RRID:SCR_001193

RRID:SCR_015008

About

Recent News Entries

Contact Us

SciCrunch