Searching across hundreds of databases

Our searching services are busy right now. Your search will reload in five seconds.

X
Forgot Password

If you have forgotten your password you can enter your email here and get a temporary password sent to your email.

X
Forgot Password

If you have forgotten your password you can enter your email here and get a temporary password sent to your email.

Assessment of de novo assemblers for draft genomes: a case study with fungal genomes.

BMC genomics | 2014

Recently, large bio-projects dealing with the release of different genomes have transpired. Most of these projects use next-generation sequencing platforms. As a consequence, many de novo assembly tools have evolved to assemble the reads generated by these platforms. Each tool has its own inherent advantages and disadvantages, which make the selection of an appropriate tool a challenging task.

Pubmed ID: 25521762 RIS Download

Associated grants

None

Publication data is provided by the National Library of Medicine ® and PubMed ®. Data is retrieved from PubMed ® on a weekly schedule. For terms and conditions see the National Library of Medicine Terms and Conditions.

This is a list of tools and resources that we have found mentioned in this publication.


SPAdes (tool)

RRID:SCR_000131

Software package for assembling single cell genomes and mini metagenomes. Uses short read sets as input. Used for genomes of uncultivatable bacteria that vastly exceeds what may be obtained via traditional metagenomics studies. Works with Illumina or IonTorrent reads and can provide hybrid assemblies using PacBio, Oxford Nanopore and Sanger reads. Intended for small genomes like bacterial or fungal.

View all literature mentions

SparseAssembler (tool)

RRID:SCR_001100

Software for memory-efficient genome assembly. It utilizes sparse k-mer.

View all literature mentions

DNA DataBank of Japan (DDBJ) (tool)

RRID:SCR_002359

Maintains and provides archival, retrieval and analytical resources for biological information. Central DDBJ resource consists of public, open-access nucleotide sequence databases including raw sequence reads, assembly information and functional annotation. Database content is exchanged with EBI and NCBI within the framework of the International Nucleotide Sequence Database Collaboration (INSDC). In 2011, DDBJ launched two new resources: DDBJ Omics Archive and BioProject. DOR is archival database of functional genomics data generated by microarray and highly parallel new generation sequencers. Data are exchanged between the ArrayExpress at EBI and DOR in the common MAGE-TAB format. BioProject provides organizational framework to access metadata about research projects and data from projects that are deposited into different databases.

View all literature mentions

NCBI BioProject (tool)

RRID:SCR_004801

Database of biological data related to a single initiative, originating from a single organization or from a consortium. A BioProject record provides users a single place to find links to the diverse data types generated for that project. It is a searchable collection of complete and incomplete (in-progress) large-scale sequencing, assembly, annotation, and mapping projects for cellular organisms. Submissions are supported by a web-based Submission Portal. The database facilitates organization and classification of project data submitted to NCBI, EBI and DDBJ databases that captures descriptive information about research projects that result in high volume submissions to archival databases, ties together related data across multiple archives and serves as a central portal by which to inform users of data availability. BioProject records link to corresponding data stored in archival repositories. The BioProject resource is a redesigned, expanded, replacement of the NCBI Genome Project resource. The redesign adds tracking of several data elements including more precise information about a project''''s scope, material, and objectives. Genome Project identifiers are retained in the BioProject as the ID value for a record, and an Accession number has been added. Database content is exchanged with other members of the International Nucleotide Sequence Database Collaboration (INSDC). BioProject is accessible via FTP.

View all literature mentions

Minia (tool)

RRID:SCR_004986

A short-read assembler based on a de Bruijn graph, capable of assembling a human genome on a desktop computer in a day.

View all literature mentions

ABySS (tool)

RRID:SCR_010709

Software providing de novo, parallel, paired-end sequence assembler that is designed for short reads. ABySS 1.0 originally showed that assembling human genome using short 50 bp sequencing reads was possible by aggregating half terabyte of compute memory needed over several computers using standardized message passing system. ABySS 2.0 is Resource Efficient Assembly of Large Genomes using Bloom Filter. ABySS 2.0 departs from MPI and instead implements algorithms that employ Bloom filter, probabilistic data structure, to represent de Bruijn graph and reduce memory requirements.

View all literature mentions

SOAPdenovo (tool)

RRID:SCR_010752

THIS RESOURCE IS NO LONGER IN SERVICE. Documented on February 24,2023. Software tool for de novo assembly of human genomes with massively parallel short read sequencing.Short-read assembly method that can build de novo draft assembly for human sized genomes.Software package for assembling short oligonucleotide into contigs and scaffolds.

View all literature mentions

IDBA-UD (tool)

RRID:SCR_011912

Software for an iterative De Bruijn Graph De Novo Assembler for Short Reads Sequencing data with Highly Uneven Sequencing Depth.

View all literature mentions

CEGMA (tool)

RRID:SCR_015055

THIS RESOURCE IS NO LONGER IN SERVICE, documented on January 19, 2022. Tool to annotate core genes in eukaryotic genomes (that was replaced by BUSCO). Its resulting core gene dataset can be used to train a gene finder or to assess the completeness of the genome or annotations.

View all literature mentions