Searching across hundreds of databases

Our searching services are busy right now. Your search will reload in five seconds.

X
Forgot Password

If you have forgotten your password you can enter your email here and get a temporary password sent to your email.

X
Forgot Password

If you have forgotten your password you can enter your email here and get a temporary password sent to your email.

Reference-guided de novo assembly approach improves genome reconstruction for related species.

BMC bioinformatics | 2017

The development of next-generation sequencing has made it possible to sequence whole genomes at a relatively low cost. However, de novo genome assemblies remain challenging due to short read length, missing data, repetitive regions, polymorphisms and sequencing errors. As more and more genomes are sequenced, reference-guided assembly approaches can be used to assist the assembly process. However, previous methods mostly focused on the assembly of other genotypes within the same species. We adapted and extended a reference-guided de novo assembly approach, which enables the usage of a related reference sequence to guide the genome assembly. In order to compare and evaluate de novo and our reference-guided de novo assembly approaches, we used a simulated data set of a repetitive and heterozygotic plant genome.

Pubmed ID: 29126390 RIS Download

Publication data is provided by the National Library of Medicine ® and PubMed ®. Data is retrieved from PubMed ® on a weekly schedule. For terms and conditions see the National Library of Medicine Terms and Conditions.

This is a list of tools and resources that we have found mentioned in this publication.


GATK (tool)

RRID:SCR_001876

A software package to analyze next-generation resequencing data. The toolkit offers a wide variety of tools, with a primary focus on variant discovery and genotyping as well as strong emphasis on data quality assurance. Its robust architecture, powerful processing engine and high-performance computing features make it capable of taking on projects of any size. This software library makes writing efficient analysis tools using next-generation sequencing data very easy, and second it's a suite of tools for working with human medical resequencing projects such as 1000 Genomes and The Cancer Genome Atlas. These tools include things like a depth of coverage analyzers, a quality score recalibrator, a SNP/indel caller and a local realigner. (entry from Genetic Analysis Software)

View all literature mentions

SAMTOOLS (tool)

RRID:SCR_002105

Original SAMTOOLS package has been split into three separate repositories including Samtools, BCFtools and HTSlib. Samtools for manipulating next generation sequencing data used for reading, writing, editing, indexing,viewing nucleotide alignments in SAM,BAM,CRAM format. BCFtools used for reading, writing BCF2,VCF, gVCF files and calling, filtering, summarising SNP and short indel sequence variants. HTSlib used for reading, writing high throughput sequencing data.

View all literature mentions

Picard (tool)

RRID:SCR_006525

Java toolset for working with next generation sequencing data in the BAM format.

View all literature mentions

ABySS (tool)

RRID:SCR_010709

Software providing de novo, parallel, paired-end sequence assembler that is designed for short reads. ABySS 1.0 originally showed that assembling human genome using short 50 bp sequencing reads was possible by aggregating half terabyte of compute memory needed over several computers using standardized message passing system. ABySS 2.0 is Resource Efficient Assembly of Large Genomes using Bloom Filter. ABySS 2.0 departs from MPI and instead implements algorithms that employ Bloom filter, probabilistic data structure, to represent de Bruijn graph and reduce memory requirements.

View all literature mentions

ALLPATHS-LG (tool)

RRID:SCR_010742

Software tool as whole genome shotgun assembler that can generate high quality genome assemblies using short reads (~100bp) such as those produced by the new generation of sequencers.

View all literature mentions

Trimmomatic (tool)

RRID:SCR_011848

Software Java pipeline for trimming tasks for Illumina paired end and single ended data. Flexible Trimmer for Illumina Sequence Data. Pair aware preprocessing tool optimized for Illumina next generation sequencing data. Includes several processing steps for read trimming and filtering. Operating systems Unix/Linux, Mac OS, Windows.

View all literature mentions

IDBA-UD (tool)

RRID:SCR_011912

Software for an iterative De Bruijn Graph De Novo Assembler for Short Reads Sequencing data with Highly Uneven Sequencing Depth.

View all literature mentions

FastQC (tool)

RRID:SCR_014583

Quality control software that perform checks on raw sequence data coming from high throughput sequencing pipelines. This software also provides a modular set of analyses which can give a quick impression of the quality of the data prior to further analysis.

View all literature mentions

SOAPdenovo (tool)

RRID:SCR_014986

THIS RESOURCE IS NO LONGER IN SERVICE. Documented on February 24,2023. Software tool for de novo assembly of human genomes with massively parallel short read sequencing.Short-read assembly method that can build de novo draft assembly for human sized genomes.Software package for assembling short oligonucleotide into contigs and scaffolds.

View all literature mentions

CEGMA (tool)

RRID:SCR_015055

THIS RESOURCE IS NO LONGER IN SERVICE, documented on January 19, 2022. Tool to annotate core genes in eukaryotic genomes (that was replaced by BUSCO). Its resulting core gene dataset can be used to train a gene finder or to assess the completeness of the genome or annotations.

View all literature mentions

Bamtools (tool)

RRID:SCR_015987

Software that provides both a C++ API and a command-line toolkit for reading, writing, and manipulating genome sequence alignment files in the BAM and SAM formats. It is used for research analysis and management of data produced by sequencing technologies.

View all literature mentions

SOAPdenovo (tool)

RRID:SCR_010752

THIS RESOURCE IS NO LONGER IN SERVICE. Documented on February 24,2023. Software tool for de novo assembly of human genomes with massively parallel short read sequencing.Short-read assembly method that can build de novo draft assembly for human sized genomes.Software package for assembling short oligonucleotide into contigs and scaffolds.

View all literature mentions