As studies of DNA methylation increase in scope, it has become evident that methylation has a complex relationship with gene expression, plays an important role in defining cell types, and is disrupted in many diseases. We describe large-scale single-base resolution DNA methylation profiling on a diverse collection of 82 human cell lines and tissues using reduced representation bisulfite sequencing (RRBS). Analysis integrating RNA-seq and ChIP-seq data illuminates the functional role of this dynamic mark. Loci that are hypermethylated across cancer types are enriched for sites bound by NANOG in embryonic stem cells, which supports and expands the model of a stem/progenitor cell signature in cancer. CpGs that are hypomethylated across cancer types are concentrated in megabase-scale domains that occur near the telomeres and centromeres of chromosomes, are depleted of genes, and are enriched for cancer-specific EZH2 binding and H3K27me3 (repressive chromatin). In noncancer samples, there are cell-type specific methylation signatures preserved in primary cell lines and tissues as well as methylation differences induced by cell culture. The relationship between methylation and expression is context-dependent, and we find that CpG-rich enhancers bound by EP300 in the bodies of expressed genes are unmethylated despite the dense gene-body methylation surrounding them. Non-CpG cytosine methylation occurs in human somatic tissue, is particularly prevalent in brain tissue, and is reproducible across many individuals. This study provides an atlas of DNA methylation across diverse and well-characterized samples and enables new discoveries about DNA methylation and its role in gene regulation and disease.
Pubmed ID: 23325432 RIS Download
Publication data is provided by the National Library of Medicine ® and PubMed ®. Data is retrieved from PubMed ® on a weekly schedule. For terms and conditions see the National Library of Medicine Terms and Conditions.
A tool for identifying and visualizing enriched GO terms in ranked lists of genes. It can be run in one of two modes: * Searching for enriched GO terms that appear densely at the top of a ranked list of genes or * Searching for enriched GO terms in a target list of genes compared to a background list of genes.
View all literature mentionsWeb application to generate sequence logos, graphical representations of patterns within multiple sequence alignment. Designed to make generation of sequence logos easy. Sequence logo generator.
View all literature mentionsVisualize and analyze data generated by all of Illumina''s platforms.
View all literature mentionsCell line GM12878 is a Transformed cell line with a species of origin Homo sapiens (Human)
View all literature mentionsCell line Hep-G2 is a Cancer cell line with a species of origin Homo sapiens (Human)
View all literature mentionsCell line K-562 is a Cancer cell line with a species of origin Homo sapiens (Human)
View all literature mentionsCell line HeLa is a Cancer cell line with a species of origin Homo sapiens
View all literature mentions