Searching across hundreds of databases

Our searching services are busy right now. Your search will reload in five seconds.

X
Forgot Password

If you have forgotten your password you can enter your email here and get a temporary password sent to your email.

X
Forgot Password

If you have forgotten your password you can enter your email here and get a temporary password sent to your email.

MetaCRAM: an integrated pipeline for metagenomic taxonomy identification and compression.

BMC bioinformatics | 2016

Metagenomics is a genomics research discipline devoted to the study of microbial communities in environmental samples and human and animal organs and tissues. Sequenced metagenomic samples usually comprise reads from a large number of different bacterial communities and hence tend to result in large file sizes, typically ranging between 1-10 GB. This leads to challenges in analyzing, transferring and storing metagenomic data. In order to overcome these data processing issues, we introduce MetaCRAM, the first de novo, parallelized software suite specialized for FASTA and FASTQ format metagenomic read processing and lossless compression.

Pubmed ID: 26895947 RIS Download

Research resources used in this publication

None found

Additional research tools detected in this publication

Antibodies used in this publication

None found

Associated grants

  • Agency: NCI NIH HHS, United States
    Id: U01 CA198943
  • Agency: NCI NIH HHS, United States
    Id: U01 CA198943-02

Publication data is provided by the National Library of Medicine ® and PubMed ®. Data is retrieved from PubMed ® on a weekly schedule. For terms and conditions see the National Library of Medicine Terms and Conditions.

This is a list of tools and resources that we have found mentioned in this publication.


MetaPhyler (tool)

RRID:SCR_004848

A taxonomic classifier for metagenomic shotgun reads, which uses phylogenetic marker genes as a taxonomic reference. The classifier, based on BLAST, uses different thresholds (automatically learned from the reference database) for each combination of taxonomic rank, reference gene, and sequence length. The reference database includes marker genes from all complete genomes, several draft genomes and the NCBI nr protein database.

View all literature mentions

Kraken (tool)

RRID:SCR_005484

A set of software tools ( Reaper, Tally and Sequence Imp) designed to streamline the analysis of next-generation sequencing data. Although designed with small RNA sequence analysis in mind the tools can be used to address issues facing next-generation sequencing in general.

View all literature mentions

MFCompress (tool)

RRID:SCR_009301

A compression tool for FASTA and multi-FASTA files.

View all literature mentions