Searching across hundreds of databases

Our searching services are busy right now. Your search will reload in five seconds.

X
Forgot Password

If you have forgotten your password you can enter your email here and get a temporary password sent to your email.

X
Forgot Password

If you have forgotten your password you can enter your email here and get a temporary password sent to your email.

Rare genetic variants underlie outlying levels of DNA methylation and gene-expression.

Human molecular genetics | 2023

Testing the effect of rare variants on phenotypic variation is difficult due to the need for extremely large cohorts to identify associated variants given expected effect sizes. An alternative approach is to investigate the effect of rare genetic variants on DNA methylation (DNAm) as effect sizes are expected to be larger for molecular traits compared with complex traits. Here, we investigate DNAm in healthy ageing populations-the Lothian Birth Cohorts of 1921 and 1936-and identify both transient and stable outlying DNAm levels across the genome. We find an enrichment of rare genetic single nucleotide polymorphisms (SNPs) within 1 kb of DNAm sites in individuals with stable outlying DNAm, implying genetic control of this extreme variation. Using a family-based cohort, the Brisbane Systems Genetics Study, we observed increased sharing of DNAm outliers among more closely related individuals, consistent with these outliers being driven by rare genetic variation. We demonstrated that outlying DNAm levels have a functional consequence on gene expression levels, with extreme levels of DNAm being associated with gene expression levels toward the tails of the population distribution. This study demonstrates the role of rare SNPs in the phenotypic variation of DNAm and the effect of extreme levels of DNAm on gene expression.

Pubmed ID: 36790133 RIS Download

Research resources used in this publication

None found

Antibodies used in this publication

None found

Associated grants

  • Agency: Wellcome Trust, United Kingdom
  • Agency: Medical Research Council, United Kingdom
    Id: MR/K026992/1
  • Agency: Biotechnology and Biological Sciences Research Council, United Kingdom
  • Agency: Chief Scientist Office, United Kingdom

Publication data is provided by the National Library of Medicine ® and PubMed ®. Data is retrieved from PubMed ® on a weekly schedule. For terms and conditions see the National Library of Medicine Terms and Conditions.

This is a list of tools and resources that we have found mentioned in this publication.


GATK (tool)

RRID:SCR_001876

A software package to analyze next-generation resequencing data. The toolkit offers a wide variety of tools, with a primary focus on variant discovery and genotyping as well as strong emphasis on data quality assurance. Its robust architecture, powerful processing engine and high-performance computing features make it capable of taking on projects of any size. This software library makes writing efficient analysis tools using next-generation sequencing data very easy, and second it's a suite of tools for working with human medical resequencing projects such as 1000 Genomes and The Cancer Genome Atlas. These tools include things like a depth of coverage analyzers, a quality score recalibrator, a SNP/indel caller and a local realigner. (entry from Genetic Analysis Software)

View all literature mentions

Gene Expression Omnibus (GEO) (tool)

RRID:SCR_005012

Functional genomics data repository supporting MIAME-compliant data submissions. Includes microarray-based experiments measuring the abundance of mRNA, genomic DNA, and protein molecules, as well as non-array-based technologies such as serial analysis of gene expression (SAGE) and mass spectrometry proteomic technology. Array- and sequence-based data are accepted. Collection of curated gene expression DataSets, as well as original Series and Platform records. The database can be searched using keywords, organism, DataSet type and authors. DataSet records contain additional resources including cluster tools and differential expression queries.

View all literature mentions

PEER (tool)

RRID:SCR_009326

Software collection of Bayesian approaches to infer hidden determinants and their effects from gene expression profiles using factor analysis methods. Applications of PEER have * detected batch effects and experimental confounders * increased the number of expression QTL findings by threefold * allowed inference of intermediate cellular traits, such as transcription factor or pathway activations This project offers an efficient and versatile C++ implementation of the underlying algorithms with user-friendly interfaces to R and python.

View all literature mentions

CRAM (tool)

RRID:SCR_012975

A framework technology comprising file format and toolkit in which we combine highly efficient and tunable reference-based compression of sequence data with a data format that is directly available for computational use.

View all literature mentions