Jellyfish belong to the phylum Cnidaria, which occupies an important phylogenetic location in the early-branching Metazoa lineages. The jellyfish Rhopilema esculentum is an important fishery resource in China. However, the genome resource of R. esculentum has not been reported to date.
Pubmed ID: 32315029 RIS Download
Publication data is provided by the National Library of Medicine ® and PubMed ®. Data is retrieved from PubMed ® on a weekly schedule. For terms and conditions see the National Library of Medicine Terms and Conditions.
OrthoMCL is a genome-scale algorithm for grouping orthologous protein sequences. It provides not only groups shared by two or more species/genomes, but also groups representing species-specific gene expansion families. OrthoMCL starts with reciprocal best hits within each genome as putative in-paralog/recent paralog pairs and reciprocal best hits across any two genomes as putative ortholog pairs. Related proteins are interlinked in a similarity graph. Then MCL (Markov Clustering algorithm,Van Dongen 2000; www.micans.org/mcl) is invoked to split mega-clusters. This process is analogous to the manual review in COG construction. MCL clustering is based on weights between each pair of proteins, so to correct for differences in evolutionary distance the weights are normalized before running MCL.
View all literature mentionsMultiple sequence alignment method with reduced time and space complexity.Multiple sequence alignment with high accuracy and high throughput. Data analysis service for multiple sequence comparison by log- expectation.
View all literature mentionsSoftware tool to quantitatively measure genome assembly and annotation completeness based on evolutionarily informed expectations of gene content.
View all literature mentionsSoftware for aligning sequencing reads against large reference genome. Consists of three algorithms: BWA-backtrack, BWA-SW and BWA-MEM. First for sequence reads up to 100bp, and other two for longer sequences ranged from 70bp to 1Mbp.
View all literature mentionsSoftware tool for transcriptome assembly and differential expression analysis for RNA-Seq. Includes script called cuffmerge that can be used to merge together several Cufflinks assemblies. It also handles running Cuffcompare as well as automatically filtering a number of transfrags that are likely to be artifacts. If the researcher has a reference GTF file, the researcher can provide it to the script to more effectively merge novel isoforms and maximize overall assembly quality.
View all literature mentionsSoftware tool for fast and high throughput alignment of shotgun cDNA sequencing reads generated by transcriptomics technologies. Fast splice junction mapper for RNA-Seq reads. Aligns RNA-Seq reads to mammalian-sized genomes using ultra high-throughput short read aligner Bowtie, and then analyzes mapping results to identify splice junctions between exons.TopHat2 is accurate alignment of transcriptomes in presence of insertions, deletions and gene fusions.
View all literature mentionsA gene finder based on a Generalized Hidden Markov Model (GHMM). Although the gene finder conforms to the overall mathematical framework of a GHMM, additionally it incorporates splice site models adapted from the GeneSplicer program and a decision tree adapted from GlimmerM. It also utilizes Interpolated Markov Models for the coding and noncoding models . Currently, GlimmerHMM's GHMM structure includes introns of each phase, intergenic regions, and four types of exons (initial, internal, final, and single).
View all literature mentionsGene alignment tool from the EBI which predicts gene structure using similar protein sequences. See also the associated GenomeWise tool.
View all literature mentionsSoftware for gene prediction in eukaryotic genomic sequences. Serves as a basis for further steps in the analysis of sequenced and assembled eukaryotic genomes.
View all literature mentionsSoftware tool that screens DNA sequences for interspersed repeats and low complexity DNA sequences. The output of the program is a detailed annotation of the repeats that are present in the query sequence as well as a modified version of the query sequence in which all the annotated repeats have been masked (default: replaced by Ns). Currently over 56% of human genomic sequence is identified and masked by the program. Sequence comparisons in RepeatMasker are performed by one of several popular search engines including nhmmer, cross_match, ABBlast/WUBlast, RMBlast and Decypher. RepeatMasker makes use of curated libraries of repeats and currently supports Dfam ( profile HMM library ) and RepBase ( consensus sequence library ).
View all literature mentionsSequence analysis software that performs repeat family identification and creates models for sequence data. RepeatModeler utilizes RepeatScout and RECON to identify repeat element boundaries and family relationships.
View all literature mentionsQuality control software that perform checks on raw sequence data coming from high throughput sequencing pipelines. This software also provides a modular set of analyses which can give a quick impression of the quality of the data prior to further analysis.
View all literature mentionsSoftware program for phylogenetic analyses of large datasets under maximum likelihood.
View all literature mentions