Searching across hundreds of databases

Our searching services are busy right now. Your search will reload in five seconds.

X
Forgot Password

If you have forgotten your password you can enter your email here and get a temporary password sent to your email.

X
Forgot Password

If you have forgotten your password you can enter your email here and get a temporary password sent to your email.

Genetic interaction analysis among oncogenesis-related genes revealed novel genes and networks in lung cancer development.

Yafang Li | Xiangjun Xiao | Yohan Bossé | Olga Gorlova | Ivan Gorlov | Younghun Han | Jinyoung Byun | Natasha Leighl | Jakob S Johansen | Matt Barnett | Chu Chen | Gary Goodman | Angela Cox | Fiona Taylor | Penella Woll | H Erich Wichmann | Judith Manz | Thomas Muley | Angela Risch | Albert Rosenberger | Jiali Han | Katherine Siminovitch | Susanne M Arnold | Eric B Haura | Ciprian Bolca | Ivana Holcatova | Vladimir Janout | Milica Kontic | Jolanta Lissowska | Anush Mukeria | Simona Ognjanovic | Tadeusz M Orlowski | Ghislaine Scelo | Beata Swiatkowska | David Zaridze | Per Bakke | Vidar Skaug | Shanbeh Zienolddiny | Eric J Duell | Lesley M Butler | Richard Houlston | María Soler Artigas | Kjell Grankvist | Mikael Johansson | Frances A Shepherd | Michael W Marcus | Hans Brunnström | Jonas Manjer | Olle Melander | David C Muller | Kim Overvad | Antonia Trichopoulou | Rosario Tumino | Geoffrey Liu | Stig E Bojesen | Xifeng Wu | Loic Le Marchand | Demetrios Albanes | Heike Bickeböller | Melinda C Aldrich | William S Bush | Adonina Tardon | Gad Rennert | M Dawn Teare | John K Field | Lambertus A Kiemeney | Philip Lazarus | Aage Haugen | Stephen Lam | Matthew B Schabath | Angeline S Andrew | Pier Alberto Bertazzi | Angela C Pesatori | David C Christiani | Neil Caporaso | Mattias Johansson | James D McKay | Paul Brennan | Rayjean J Hung | Christopher I Amos
Oncotarget | 2019

The development of cancer is driven by the accumulation of many oncogenesis-related genetic alterations and tumorigenesis is triggered by complex networks of involved genes rather than independent actions. To explore the epistasis existing among oncogenesis-related genes in lung cancer development, we conducted pairwise genetic interaction analyses among 35,031 SNPs from 2027 oncogenesis-related genes. The genotypes from three independent genome-wide association studies including a total of 24,037 lung cancer patients and 20,401 healthy controls with Caucasian ancestry were analyzed in the study. Using a two-stage study design including discovery and replication studies, and stringent Bonferroni correction for multiple statistical analysis, we identified significant genetic interactions between SNPs in RGL1:RAD51B (OR=0.44, p value=3.27x10-11 in overall lung cancer and OR=0.41, p value=9.71x10-11 in non-small cell lung cancer), SYNE1:RNF43 (OR=0.73, p value=1.01x10-12 in adenocarcinoma) and FHIT:TSPAN8 (OR=1.82, p value=7.62x10-11 in squamous cell carcinoma) in our analysis. None of these genes have been identified from previous main effect association studies in lung cancer. Further eQTL gene expression analysis in lung tissues provided information supporting the functional role of the identified epistasis in lung tumorigenesis. Gene set enrichment analysis revealed potential pathways and gene networks underlying molecular mechanisms in overall lung cancer as well as histology subtypes development. Our results provide evidence that genetic interactions between oncogenesis-related genes play an important role in lung tumorigenesis and epistasis analysis, combined with functional annotation, provides a valuable tool for uncovering functional novel susceptibility genes that contribute to lung cancer development by interacting with other modifier genes.

Pubmed ID: 30956756 RIS Download

Associated grants

  • Agency: NCI NIH HHS, United States
    Id: R35 CA197449
  • Agency: NCI NIH HHS, United States
    Id: P30 CA076292
  • Agency: World Health Organization, International
    Id: 001
  • Agency: NCI NIH HHS, United States
    Id: U01 CA167462
  • Agency: NCI NIH HHS, United States
    Id: U19 CA203654
  • Agency: NCI NIH HHS, United States
    Id: P50 CA119997
  • Agency: NCI NIH HHS, United States
    Id: R21 CA235464

Publication data is provided by the National Library of Medicine ® and PubMed ®. Data is retrieved from PubMed ® on a weekly schedule. For terms and conditions see the National Library of Medicine Terms and Conditions.

This is a list of tools and resources that we have found mentioned in this publication.


PLINK (tool)

RRID:SCR_001757

Open source whole genome association analysis toolset, designed to perform range of basic, large scale analyses in computationally efficient manner. Used for analysis of genotype/phenotype data. Through integration with gPLINK and Haploview, there is some support for subsequent visualization, annotation and storage of results. PLINK 1.9 is improved and second generation of the software.

View all literature mentions

Gene Set Enrichment Analysis (tool)

RRID:SCR_003199

Software package for interpreting gene expression data. Used for interpretation of a large-scale experiment by identifying pathways and processes.

View all literature mentions

Ingenuity Pathways Knowledge Base (tool)

RRID:SCR_008117

A horizontally and vertically structured database that pulls scientific and medical information and describes it consistently using the Ingenuity Ontology. The Knowledge Base pulls information from journals, public molecular content databases, and textbooks. Data is curated and and integrated into the Knowledge Base .

View all literature mentions

QIAGEN (tool)

RRID:SCR_008539

A commercial organization which provides assay technologies to isolate DNA, RNA, and proteins from any biological sample. Assay technologies are then used to make specific target biomolecules, such as the DNA of a specific virus, visible for subsequent analysis.

View all literature mentions

Ingenuity Pathway Analysis (tool)

RRID:SCR_008653

A web-based software application that enables users to analyze, integrate, and understand data derived from gene expression, microRNA, and SNP microarrays, metabolomics, proteomics, and RNA-Seq experiments, and small-scale experiments that generate gene and chemical lists. Users can search for targeted information on genes, proteins, chemicals, and drugs, and build interactive models of experimental systems. IPA allows exploration of molecular, chemical, gene, protein and miRNA interactions, creation of custom molecular pathways, and the ability to view and modify metabolic, signaling, and toxicological canonical pathways. In addition to the networks and pathways that can be created, IPA can provide multiple layering of additional information, such as drugs, disease genes, expression data, cellular functions and processes, or a researchers own genes or chemicals of interest.

View all literature mentions

1000 Genomes Project and AWS (tool)

RRID:SCR_008801

A dataset containing the full genomic sequence of 1,700 individuals, freely available for research use. The 1000 Genomes Project is an international research effort coordinated by a consortium of 75 companies and organizations to establish the most detailed catalogue of human genetic variation. The project has grown to 200 terabytes of genomic data including DNA sequenced from more than 1,700 individuals that researchers can now access on AWS for use in disease research free of charge. The dataset containing the full genomic sequence of 1,700 individuals is now available to all via Amazon S3. The data can be found at: http://s3.amazonaws.com/1000genomes The 1000 Genomes Project aims to include the genomes of more than 2,662 individuals from 26 populations around the world, and the NIH will continue to add the remaining genome samples to the data collection this year. Public Data Sets on AWS provide a centralized repository of public data hosted on Amazon Simple Storage Service (Amazon S3). The data can be seamlessly accessed from AWS services such Amazon Elastic Compute Cloud (Amazon EC2) and Amazon Elastic MapReduce (Amazon EMR), which provide organizations with the highly scalable compute resources needed to take advantage of these large data collections. AWS is storing the public data sets at no charge to the community. Researchers pay only for the additional AWS resources they need for further processing or analysis of the data. All 200 TB of the latest 1000 Genomes Project data is available in a publicly available Amazon S3 bucket. You can access the data via simple HTTP requests, or take advantage of the AWS SDKs in languages such as Ruby, Java, Python, .NET and PHP. Researchers can use the Amazon EC2 utility computing service to dive into this data without the usual capital investment required to work with data at this scale. AWS also provides a number of orchestration and automation services to help teams make their research available to others to remix and reuse. Making the data available via a bucket in Amazon S3 also means that customers can crunch the information using Hadoop via Amazon Elastic MapReduce, and take advantage of the growing collection of tools for running bioinformatics job flows, such as CloudBurst and Crossbow.

View all literature mentions

IMPUTE2 (tool)

RRID:SCR_013055

A computer program for phasing observed genotypes and imputing missing genotypes.

View all literature mentions