Preparing your results

Our searching services are busy right now. Your search will reload in five seconds.

X
Forgot Password

If you have forgotten your password you can enter your email here and get a temporary password sent to your email.

PeakSeq enables systematic scoring of ChIP-seq experiments relative to controls.

Nature biotechnology | Jan 9, 2009

Chromatin immunoprecipitation (ChIP) followed by tag sequencing (ChIP-seq) using high-throughput next-generation instrumentation is fast, replacing chromatin immunoprecipitation followed by genome tiling array analysis (ChIP-chip) as the preferred approach for mapping of sites of transcription-factor binding and chromatin modification. Using two deeply sequenced data sets for human RNA polymerase II and STAT1, each with matching input-DNA controls, we describe a general scoring approach to address unique challenges in ChIP-seq data analysis. Our approach is based on the observation that sites of potential binding are strongly correlated with signal peaks in the control, likely revealing features of open chromatin. We develop a two-pass strategy called PeakSeq to compensate for this. A two-pass strategy compensates for signal caused by open chromatin, as revealed by inclusion of the controls. The first pass identifies putative binding sites and compensates for genomic variation in the 'mappability' of sequences. The second pass filters out sites not significantly enriched compared to the normalized control, computing precise enrichments and significances. Our scoring procedure enables us to optimize experimental design by estimating the depth of sequencing required for a desired level of coverage and demonstrating that more than two replicates provides only a marginal gain in information.

Pubmed ID: 19122651 RIS Download

Mesh terms: Binding Sites | Biotechnology | Chromatin | Chromatin Immunoprecipitation | DNA | False Positive Reactions | Genetic Variation | Genome | Genomics | Humans | Models, Genetic | Oligonucleotide Array Sequence Analysis | RNA Polymerase II | Sequence Analysis, DNA | Software

Research resources used in this publication

None found

Research tools detected in this publication

None found

Data used in this publication

None found

Associated grants

  • Agency: NCRR NIH HHS, Id: RR19895
  • Agency: NHGRI NIH HHS, Id: U54 HG004558
  • Agency: NHGRI NIH HHS, Id: U54 HG004558-02

Mouse Genome Informatics (Data, Gene Annotation)

Publication data is provided by the National Library of Medicine ® and PubMed ®. Data is retrieved from PubMed ® on a weekly schedule. For terms and conditions see the National Library of Medicine Terms and Conditions.