This service exclusively searches for literature that cites resources. Please be aware that the total number of searchable documents is limited to those containing RRIDs and does not include all open-access literature.
With millions of single-nucleotide polymorphisms (SNPs) identified and characterized, genomewide association studies have begun to identify susceptibility genes for complex traits and diseases. These studies involve the characterization and analysis of very-high-resolution SNP genotype data for hundreds or thousands of individuals. We describe a computationally efficient approach to testing association between SNPs and quantitative phenotypes, which can be applied to whole-genome association scans. In addition to observed genotypes, our approach allows estimation of missing genotypes, resulting in substantial increases in power when genotyping resources are limited. We estimate missing genotypes probabilistically using the Lander-Green or Elston-Stewart algorithms and combine high-resolution SNP genotypes for a subset of individuals in each pedigree with sparser marker data for the remaining individuals. We show that power is increased whenever phenotype information for ungenotyped individuals is included in analyses and that high-density genotyping of just three carefully selected individuals in a nuclear family can recover >90% of the information available if every individual were genotyped, for a fraction of the cost and experimental effort. To aid in study design, we evaluate the power of strategies that genotype different subsets of individuals in each pedigree and make recommendations about which individuals should be genotyped at a high density. To illustrate our method, we performed genomewide association analysis for 27 gene-expression phenotypes in 3-generation families (Centre d'Etude du Polymorphisme Humain pedigrees), in which genotypes for ~860,000 SNPs in 90 grandparents and parents are complemented by genotypes for ~6,700 SNPs in a total of 168 individuals. In addition to increasing the evidence of association at 15 previously identified cis-acting associated alleles, our genotype-inference algorithm allowed us to identify associated alleles at 4 cis-acting loci that were missed when analysis was restricted to individuals with the high-density SNP data. Our genotype-inference algorithm and the proposed association tests are implemented in software that is available for free.
Genome-wide association studies (GWASs) have identified abundant genetic susceptibility loci, GWAS of small sample size are far less from meeting the previous expectations due to low statistical power and false positive results. Effective statistical methods are required to further improve the analyses of massive GWAS data. Here we presented a new statistic (Robust Reference Powered Association Test) to use large public database (gnomad) as reference to reduce concern of potential population stratification. To evaluate the performance of this statistic for various situations, we simulated multiple sets of sample size and frequencies to compute statistical power. Furthermore, we applied our method to several real datasets (psoriasis genome-wide association datasets and schizophrenia genome-wide association dataset) to evaluate the performance. Careful analyses indicated that our newly developed statistic outperformed several previously developed GWAS applications. Importantly, this statistic is more robust than naive merging method in the presence of small control-reference differentiation, therefore likely to detect more association signals.
Modern improvement of complex traits in agricultural species relies on successful associations of heritable molecular variation with observable phenotypes. Historically, this pursuit has primarily been based on easily measurable genetic markers. The recent advent of new technologies allows assaying and quantifying biological intermediates (hereafter endophenotypes) which are now readily measurable at a large scale across diverse individuals. The usefulness of endophenotypes for delineating the regulatory landscape of the genome and genetic dissection of complex trait variation remains underexplored in plants. The work presented here illustrated the utility of a large-scale (299-genotype and seven-tissue) gene expression resource to dissect traits across multiple levels of biological organization. Using single-tissue- and multi-tissue-based transcriptome-wide association studies (TWAS), we revealed that about half of the functional variation acts through altered transcript abundance for maize kernel traits, including 30 grain carotenoid abundance traits, 20 grain tocochromanol abundance traits, and 22 field-measured agronomic traits. Comparing the efficacy of TWAS with genome-wide association studies (GWAS) and an ensemble approach that combines both GWAS and TWAS, we demonstrated that results of TWAS in combination with GWAS increase the power to detect known genes and aid in prioritizing likely causal genes. Using a variance partitioning approach in the largely independent maize Nested Association Mapping (NAM) population, we also showed that the most strongly associated genes identified by combining GWAS and TWAS explain more heritable variance for a majority of traits than the heritability captured by the random genes and the genes identified by GWAS or TWAS alone. This not only improves the ability to link genes to phenotypes, but also highlights the phenotypic consequences of regulatory variation in plants.
The vast majority of human populations and individuals have mixed ancestry. Consequently, adjustment for locus-specific ancestry is essential for genetic association studies. To empower association studies for all populations, it is necessary to integrate effects of locus-specific ancestry and genotype. We developed a joint test of ancestry and association that can be performed with summary statistics, that is independent of study design, can take advantage of locus-specific ancestry effects to boost power in association testing, and can utilize association effects to fine map admixture peaks. We illustrate the test using the association between serum triglycerides and LPL. By combining data from African Americans, European Americans, and West Africans, we identify three conditionally independent variants with varying amounts of ancestrally differentiated allele frequencies. Using out-of-sample data, we demonstrate improved prediction achievable by accounting for multiple causal variants and locus-specific ancestry effects at a single locus.
Iron (Fe) deficiency in plants is the result of low Fe soil availability affecting 30% of cultivated soils worldwide. To improve our understanding on Fe-efficiency this study aimed to (i) evaluate the influence of two different Fe regimes on morphological and physiological trait formation, (ii) identify polymorphisms statistically associated with morphological and physiological traits, and (iii) dissect the correlation between morphological and physiological traits using an association mapping population.
Graves' disease is an autoimmune thyroid disease of complex inheritance. Multiple genetic susceptibility loci are thought to be involved in Graves' disease and it is therefore likely that these can be identified by genome wide association studies. This study aimed to determine if a genome wide association study, using a pooling methodology, could detect genomic loci associated with Graves' disease.
We carried out an analysis of the Genetic Analysis Workshop 15 simulated Problem 3 data. We restricted ourselves to the present/absent phenotype. Linkage analysis revealed a very strong signal on chromosome 6. Association analysis revealed additional susceptible loci located on chromosomes 11 and 18. The latter two signals were subsequently verified with linkage analysis - but only after 20 replicates were pooled. Analysis of linkage disequilibrium patterns, in concert with family-based association tests, led us to infer the presence of a second chromosome 6 locus located in the vicinity of single-nucleotide polymorphisms 160-162. These analyses were carried out without knowledge of the model used to generate the simulation.
Sequencing studies are increasingly being conducted to identify rare variants associated with complex traits. The limited power of classical single-marker association analysis for rare variants poses a central challenge in such studies. We propose the sequence kernel association test (SKAT), a supervised, flexible, computationally efficient regression method to test for association between genetic variants (common and rare) in a region and a continuous or dichotomous trait while easily adjusting for covariates. As a score-based variance-component test, SKAT can quickly calculate p values analytically by fitting the null model containing only the covariates, and so can easily be applied to genome-wide data. Using SKAT to analyze a genome-wide sequencing study of 1000 individuals, by segmenting the whole genome into 30 kb regions, requires only 7 hr on a laptop. Through analysis of simulated data across a wide range of practical scenarios and triglyceride data from the Dallas Heart Study, we show that SKAT can substantially outperform several alternative rare-variant association tests. We also provide analytic power and sample-size calculations to help design candidate-gene, whole-exome, and whole-genome sequence association studies.
Background: The epidermal growth factor receptor 1 (EGFR1) plays a significant role in cell proliferation and development. Its regulation in humans is very critical and incompletely understood in Non small cell lung cancer (NSCLC). Methods: 100 newly diagnosed NSCLC (lung adenocarcinoma) patients and 100 healthy controls were included and allele specific (AS) polymerase chain reaction (PCR) was used to genotype and expression was analyzed by quantitative real time PCR. Overall survival of patients was analyzed by Kaplan-Meier method and for prognostic significance ROC curve was plotted. Results: A statistically significant difference (p<0.0001) in CC, AA and CA genotypes distribution among patients and healthy controls was observed. Compared to the CC genotype as reference, OR was 30.40 (95%CI 1.75- 524.9, p=0.0002) and 3.97 (95%CI 1.49-10.52, p=0.003) for the homozygous AA and heterozygous CA genotypes respectively. Kaplan-Meier survival analysis was also performed to analyze the relationship of EGFR1 (-191C/A) genotypes with progression free median survival of NSCLC patients and the difference was found to be significantly (p=0.0002) associated with different genotypes. In the ROC curve with respect to TNM stage at optimal cut-off value of 9.88 fold increase in EGFR1 mRNA expression, sensitivity and specificity were 92.9%, 83.3% respectively (AUC=0.95, p<0.0001). ROC curve w.r.t. distant metastases at optimal cut-off value of 13.5 fold change EGFR1 mRNA expression, sensitivity and specificity were 68.2%, 71.4% respectively (AUC=0.81, p<0.0001). In ROC curve w.r.t to presence/ absence of pleural effusion at optimal cut-off value of 14.8 fold change EGFR1 mRNA expression sensitivity and specificity were 66.7%, 68.2% respectively (AUC=0.71, p=0.009). Conclusions: Study concluded EGFR1 promoter polymorphism could be a risk factor associated with disease and may be used as prognostic marker for patients’ survival and predictor for disease worseness.
Although molecular pathway information and the International HapMap Project data can help biomedical researchers to investigate the aetiology of complex diseases more effectively, such information is missing or insufficient in current genetic association databases. In addition, only a few of the environmental risk factors are included as gene-environment interactions, and the risk measures of associations are not indexed in any association databases.
Dystonia is a common movement disorder. A number of monogenic causes have been identified. However, the majority of dystonia cases are not explained by single gene defects. Cervical dystonia is one of the commonest forms without genetic causes identified. This pilot study aimed to identify large effect-size risk loci in cervical dystonia. A genomewide association study (GWAS) was performed. British resident cervical dystonia patients of European descent were genotyped using the Illumina-610-Quad. Comparison was made with controls of European descent from the Wellcome Trust Case Control Consortium using logistic regression algorithm from PLINK. SNPs not genotyped by the array were imputed with 1000 Genomes Project data using the MaCH algorithm and minimac. Postimputation analysis was done with the mach2dat algorithm using a logistic regression model. After quality control measures, 212 cases were compared with 5173 controls. No single SNP passed the genomewide significant level of 5 × 10(-8) in the analysis of genotyped SNP in PLINK. Postimputation, there were 5 clusters of SNPs that had P value <5 × 10(-6) , and the best cluster of SNPs was found near exon 1 of NALCN, (sodium leak channel) with P = 9.76 × 10(-7) . Several potential regions were found in the GWAS and imputation analysis. The lowest P value was found in NALCN. Dysfunction of this ion channel is a plausible cause for dystonia. Further replication in another cohort is needed to confirm this finding. We make this data publicly available to encourage further analyses of this disorder.
Integrative analysis of genome-wide association studies (GWASs) and gene expression studies in the form of a transcriptome-wide association study (TWAS) has the potential to better elucidate the molecular mechanisms underlying disease etiology. Here we present a method, METRO, that can leverage gene expression data collected from multiple genetic ancestries to enhance TWASs. METRO incorporates expression prediction models constructed in different genetic ancestries through a likelihood-based inference framework, producing calibrated p values with substantially improved TWAS power. We illustrate the benefits of METRO in both simulations and applications to seven complex traits and diseases obtained from four GWASs. These GWASs include two of primarily European ancestry (n = 188,577 and 339,226) and two of primarily African ancestry (n = 42,752 and 23,827). In the real data applications, we leverage gene expression data measured on 1,032 African Americans and 801 European Americans from the Genetic Epidemiology Network of Arteriopathy (GENOA) study to identify a substantially larger number of gene-trait associations as compared to existing TWAS approaches. The benefits of METRO are most prominent in applications to GWASs of African ancestry where the sample size is much smaller than GWASs of European ancestry and where a more powerful TWAS method is crucial. Among the identified associations are high-density lipoprotein-associated genes including PLTP and PPARG that are critical for maintaining lipid homeostasis and the type II diabetes-associated gene MAPT that supports microtubule-associated protein tau as a key component underlying impaired insulin secretion.
Neonatal adiposity is a risk factor for childhood obesity. Investigating contributors to neonatal adiposity is important for understanding early life obesity risk. Epigenetic changes of metabolic genes in cord blood may contribute to excessive neonatal adiposity and subsequent childhood obesity. This study aims to evaluate the association of cord blood DNA methylation patterns with anthropometric measures and cord blood leptin, a biomarker of neonatal adiposity.
Rocuronium, a common neuromuscular blocking agent, is mainly excreted unchanged in urine (10-25%) and bile (>70%). Age, sex, liver blood flow, smoking, medical conditions, and ethnic background can affect its pharmacological actions. However, reasons for the wide variation in rocuronium requirements are mostly unknown. We hypothesised that pharmacogenetic factors might explain part of the variation.
The striatum receives projections from multiple regions of the cerebral cortex consistent with its role in diverse motor, affective, and cognitive functions. Supporting cognitive functions, the caudate receives projections from cortical association regions. Building on recent insights about the details of how multiple cortical networks are specialized for distinct aspects of higher-order cognition, we revisited caudate organization using within-individual precision neuroimaging (n=2, each participant scanned 31 times). Detailed analysis revealed that the caudate has side-by-side zones that are coupled to at least Give distinct distributed association networks, paralleling the specialization observed in the cerebral cortex. Examining correlation maps from closely juxtaposed seed regions in the caudate recapitulated the Give distinct cerebral networks including their multiple spatially distributed regions. These results extend the general notion of parallel specialized basal ganglia circuits, with the additional discovery that even within the caudate, there is Gine-grained separation of multiple distinct higher-order networks.
Coffee consumption is a model for addictive behavior. We performed a meta-analysis of genome-wide association studies (GWASs) on coffee intake from 8 Caucasian cohorts (N=18 176) and sought replication of our top findings in a further 7929 individuals. We also performed a gene expression analysis treating different cell lines with caffeine. Genome-wide significant association was observed for two single-nucleotide polymorphisms (SNPs) in the 15q24 region. The two SNPs rs2470893 and rs2472297 (P-values=1.6 × 10(-11) and 2.7 × 10(-11)), which were also in strong linkage disequilibrium (r(2)=0.7) with each other, lie in the 23-kb long commonly shared 5' flanking region between CYP1A1 and CYP1A2 genes. CYP1A1 was found to be downregulated in lymphoblastoid cell lines treated with caffeine. CYP1A1 is known to metabolize polycyclic aromatic hydrocarbons, which are important constituents of coffee, whereas CYP1A2 is involved in the primary metabolism of caffeine. Significant evidence of association was also detected at rs382140 (P-value=3.9 × 10(-09)) near NRCAM-a gene implicated in vulnerability to addiction, and at another independent hit rs6495122 (P-value=7.1 × 10(-09))-an SNP associated with blood pressure-in the 15q24 region near the gene ULK3, in the meta-analysis of discovery and replication cohorts. Our results from GWASs and expression analysis also strongly implicate CAB39L in coffee drinking. Pathway analysis of differentially expressed genes revealed significantly enriched ubiquitin proteasome (P-value=2.2 × 10(-05)) and Parkinson's disease pathways (P-value=3.6 × 10(-05)).
Blood lipid traits are treatable and heritable risk factors for heart disease, a leading cause of mortality worldwide. Although genome-wide association studies (GWAS) have discovered hundreds of variants associated with lipids in humans, most of the causal mechanisms of lipids remain unknown. To better understand the biological processes underlying lipid metabolism, we investigated the associations of plasma protein levels with total cholesterol (TC), triglycerides (TG), high-density lipoprotein cholesterol (HDL), and low-density lipoprotein cholesterol (LDL) in blood. We trained protein prediction models based on samples in the Multi-Ethnic Study of Atherosclerosis (MESA) and applied them to conduct proteome-wide association studies (PWAS) for lipids using the Global Lipids Genetics Consortium (GLGC) data. Of the 749 proteins tested, 42 were significantly associated with at least one lipid trait. Furthermore, we performed transcriptome-wide association studies (TWAS) for lipids using 9,714 gene expression prediction models trained on samples from peripheral blood mononuclear cells (PBMCs) in MESA and 49 tissues in the Genotype-Tissue Expression (GTEx) project. We found that although PWAS and TWAS can show different directions of associations in an individual gene, 40 out of 49 tissues showed a positive correlation between PWAS and TWAS signed p-values across all the genes, which suggests a high-level consistency between proteome-lipid associations and transcriptome-lipid associations.
Welcome to the FDI Lab - SciCrunch.org Resources search. From here you can search through a compilation of resources used by FDI Lab - SciCrunch.org and see how data is organized within our community.
You are currently on the Community Resources tab looking through categories and sources that FDI Lab - SciCrunch.org has compiled. You can navigate through those categories from here or change to a different tab to execute your search through. Each tab gives a different perspective on data.
If you have an account on FDI Lab - SciCrunch.org then you can log in from here to get additional features in FDI Lab - SciCrunch.org such as Collections, Saved Searches, and managing Resources.
Here is the search term that is being executed, you can type in anything you want to search for. Some tips to help searching:
You can save any searches you perform for quick access to later from here.
We recognized your search term and included synonyms and inferred terms along side your term to help get the data you are looking for.
If you are logged into FDI Lab - SciCrunch.org you can add data records to your collections to create custom spreadsheets across multiple sources of data.
Here are the facets that you can filter your papers by.
From here we'll present any options for the literature, such as exporting your current results.
If you have any further questions please check out our FAQs Page to ask questions and see our tutorials. Click this button to view this tutorial again.
Year:
Count: