Integrative analysis of the Caenorhabditis elegans genome by the modENCODE project.
We systematically generated large-scale data sets to improve genome annotation for the nematode Caenorhabditis elegans, a key model organism. These data sets include transcriptome profiling across a developmental time course, genome-wide identification of transcription factor-binding sites, and maps of chromatin organization. From this, we created more complete and accurate gene models, including alternative splice forms and candidate noncoding RNAs. We constructed hierarchical networks of transcription factor-binding and microRNA interactions and discovered chromosomal locations bound by an unusually large number of transcription factors. Different patterns of chromatin composition and histone modification were revealed between chromosome arms and centers, with similarly prominent differences between autosomes and the X chromosome. Integrating data types, we built statistical models relating chromatin, transcription factor binding, and gene expression. Overall, our analyses ascribed putative functions to most of the conserved genome.
Pubmed ID: 21177976 RIS Download
Animals | Caenorhabditis elegans | Caenorhabditis elegans Proteins | Chromatin | Chromosomes | Computational Biology | Conserved Sequence | Evolution, Molecular | Gene Expression Profiling | Gene Expression Regulation | Gene Regulatory Networks | Genes, Helminth | Genome, Helminth | Genomics | Histones | Models, Genetic | Molecular Sequence Annotation | RNA, Helminth | RNA, Untranslated | Regulatory Sequences, Nucleic Acid | Transcription Factors