Searching across hundreds of databases

Our searching services are busy right now. Your search will reload in five seconds.

X
Forgot Password

If you have forgotten your password you can enter your email here and get a temporary password sent to your email.

X
Forgot Password

If you have forgotten your password you can enter your email here and get a temporary password sent to your email.

Mouse endogenous retroviruses can trigger premature transcriptional termination at a distance.

Genome research | 2012

Endogenous retrotransposons have caused extensive genomic variation within mammalian species, but the functional implications of such mobilization are mostly unknown. We mapped thousands of endogenous retrovirus (ERV) germline integrants in highly divergent, previously unsequenced mouse lineages, facilitating a comparison of gene expression in the presence or absence of local insertions. Polymorphic ERVs occur relatively infrequently in gene introns and are particularly depleted from genes involved in embryogenesis or that are highly expressed in embryonic stem cells. Their genomic distribution implies ongoing negative selection due to deleterious effects on gene expression and function. A polymorphic, intronic ERV at Slc15a2 triggers up to 49-fold increases in premature transcriptional termination and up to 39-fold reductions in full-length transcripts in adult mouse tissues, thereby disrupting protein expression and functional activity. Prematurely truncated transcripts also occur at Polr1a, Spon1, and up to ∼5% of other genes when intronic ERV polymorphisms are present. Analysis of expression quantitative trait loci (eQTLs) in recombinant BxD mouse strains demonstrated very strong genetic associations between the polymorphic ERV in cis and disrupted transcript levels. Premature polyadenylation is triggered at genomic distances up to >12.5 kb upstream of the ERV, both in cis and between alleles. The parent of origin of the ERV is associated with variable expression of nonterminated transcripts and differential DNA methylation at its 5'-long terminal repeat. This study defines an unexpectedly strong functional impact of ERVs in disrupting gene transcription at a distance and demonstrates that ongoing retrotransposition can contribute significantly to natural phenotypic diversity.

Pubmed ID: 22367191 RIS Download

Research resources used in this publication

None found

Antibodies used in this publication

None found

Associated grants

  • Agency: CCR NIH HHS, United States
    Id: HHSN261200800001C
  • Agency: NCI NIH HHS, United States
    Id: HHSN261200800001E
  • Agency: NIGMS NIH HHS, United States
    Id: R01 GM035498
  • Agency: NIGMS NIH HHS, United States
    Id: R01-GM035498

Publication data is provided by the National Library of Medicine ® and PubMed ®. Data is retrieved from PubMed ® on a weekly schedule. For terms and conditions see the National Library of Medicine Terms and Conditions.

This is a list of tools and resources that we have found mentioned in this publication.


GeneNetwork (tool)

RRID:SCR_002388

Web platform that provides access to data and tools to study complex networks of genes, molecules, and higher order gene function and phenotypes. Sequence data (SNPs) and transcriptome data sets (expression genetic or eQTL data sets). Quantitative trait locus (QTL) mapping module that is built into GN is optimized for fast on-line analysis of traits that are controlled by combinations of gene variants and environmental factors. Used to study humans, mice (BXD, AXB, LXS, etc.), rats (HXB), Drosophila, and plant species (barley and Arabidopsis). Users are welcome to enter their own private data.

View all literature mentions

BioPerl (tool)

RRID:SCR_002989

BioPerl is a community effort to produce Perl code which is useful in biology. This toolkit of perl modules is useful in building bioinformatics solutions in Perl. It is built in an object-oriented manner so that many modules depend on each other to achieve a task. The collection of modules in the bioperl-live repository consist of the core of the functionality of bioperl. Additionally auxiliary modules for creating graphical interfaces (bioperl-gui), persistent storage in RDMBS (bioperl-db), running and parsing the results from hundreds of bioinformatics applications (Run package), software to automate bioinformatic analyses (bioperl-pipeline) are all available as Git modules in our repository. The BioPerl toolkit provides a library of hundreds of routines for processing sequence, annotation, alignment, and sequence analysis reports. It often serves as a bridge between different computational biology applications assisting the user to construct analysis pipelines. This chapter illustrates how BioPerl facilitates tasks such as writing scripts summarizing information from BLAST reports or extracting key annotation details from a GenBank sequence record. BioPerl includes modules written by Sohel Merchant of the GO Consortium for parsing and manipulating OBO ontologies. Platform: Windows compatible, Mac OS X compatible, Linux compatible, Unix compatible

View all literature mentions

Mouse Genome Informatics (MGI) (tool)

RRID:SCR_006460

International database for laboratory mouse. Data offered by The Jackson Laboratory includes information on integrated genetic, genomic, and biological data. MGI creates and maintains integrated representation of mouse genetic, genomic, expression, and phenotype data and develops reference data set and consensus data views, synthesizes comparative genomic data between mouse and other mammals, maintains set of links and collaborations with other bioinformatics resources, develops and supports analysis and data submission tools, and provides technical support for database users. Projects contributing to this resource are: Mouse Genome Database (MGD) Project, Gene Expression Database (GXD) Project, Mouse Tumor Biology (MTB) Database Project, Gene Ontology (GO) Project at MGI, and MouseCyc Project at MGI.

View all literature mentions

GMAP (tool)

RRID:SCR_008992

THIS RESOURCE IS NO LONGER IN SERVICE, documented August 29, 2016. A software program for mapping and aligning cDNA sequences to a genome. The program maps and aligns a single sequence with minimal startup time and memory requirements, and provides fast batch processing of large sequence sets. The program generates accurate gene structures, even in the presence of substantial polymorphisms and sequence errors, without using probabilistic splice site models. Methodology underlying the program includes a minimal sampling strategy for genomic mapping, oligomer chaining for approximate alignment, sandwich DP for splice site detection, and microexon identification with statistical significance testing.

View all literature mentions

BLAT (tool)

RRID:SCR_011919

Software designed to quickly find sequences of 95% and greater similarity of length 25 bases or more.

View all literature mentions

DBA/2J (tool)

RRID:IMSR_JAX:000671

Mus musculus with name DBA/2J from IMSR.

View all literature mentions