Searching across hundreds of databases

Our searching services are busy right now. Your search will reload in five seconds.

X
Forgot Password

If you have forgotten your password you can enter your email here and get a temporary password sent to your email.

X
Forgot Password

If you have forgotten your password you can enter your email here and get a temporary password sent to your email.

Large, Diverse Population Cohorts of hiPSCs and Derived Hepatocyte-like Cells Reveal Functional Genetic Variation at Blood Lipid-Associated Loci.

Cell stem cell | 2017

Genome-wide association studies have struggled to identify functional genes and variants underlying complex phenotypes. We recruited a multi-ethnic cohort of healthy volunteers (n = 91) and used their tissue to generate induced pluripotent stem cells (iPSCs) and hepatocyte-like cells (HLCs) for genome-wide mapping of expression quantitative trait loci (eQTLs) and allele-specific expression (ASE). We identified many eQTL genes (eGenes) not observed in the comparably sized Genotype-Tissue Expression project's human liver cohort (n = 96). Focusing on blood lipid-associated loci, we performed massively parallel reporter assays to screen candidate functional variants and used genome-edited stem cells, CRISPR interference, and mouse modeling to establish rs2277862-CPNE1, rs10889356-DOCK7, rs10889356-ANGPTL3, and rs10872142-FRK as functional SNP-gene sets. We demonstrated HLC eGenes CPNE1, VKORC1, UBE2L3, and ANGPTL3 and HLC ASE gene ACAA2 to be lipid-functional genes in mouse models. These findings endorse an iPSC-based experimental framework to discover functional variants and genes contributing to complex human traits.

Pubmed ID: 28388432 RIS Download

Associated grants

  • Agency: NIDDK NIH HHS, United States
    Id: P30 DK050306
  • Agency: Howard Hughes Medical Institute, United States
  • Agency: NHLBI NIH HHS, United States
    Id: RC2 HL101864
  • Agency: NCI NIH HHS, United States
    Id: P30 CA138313
  • Agency: NIDDK NIH HHS, United States
    Id: R01 DK099571
  • Agency: NCATS NIH HHS, United States
    Id: UL1 TR000003
  • Agency: NIGMS NIH HHS, United States
    Id: R01 GM104464
  • Agency: NHLBI NIH HHS, United States
    Id: R01 HL133218
  • Agency: NIMH NIH HHS, United States
    Id: R01 MH101822
  • Agency: NIDDK NIH HHS, United States
    Id: R01 DK102716
  • Agency: NHLBI NIH HHS, United States
    Id: R01 HL118744
  • Agency: NHGRI NIH HHS, United States
    Id: U01 HG006398

Publication data is provided by the National Library of Medicine ® and PubMed ®. Data is retrieved from PubMed ® on a weekly schedule. For terms and conditions see the National Library of Medicine Terms and Conditions.

This is a list of tools and resources that we have found mentioned in this publication.


STAR (tool)

RRID:SCR_015899

Software performing alignment of high-throughput RNA-seq data. Aligns RNA-seq reads to reference genome using uncompressed suffix arrays.

View all literature mentions

STAR (tool)

RRID:SCR_004463

Software performing alignment of high-throughput RNA-seq data. Aligns RNA-seq reads to reference genome using uncompressed suffix arrays.

View all literature mentions

HNF4A-human (antibody)

RRID:AB_2117025

This polyclonal targets HNF4A

View all literature mentions

C57BL/6J (organism)

RRID:IMSR_JAX:000664

Mus musculus with name C57BL/6J from IMSR.

View all literature mentions

SAMTOOLS (software resource)

RRID:SCR_002105

Original SAMTOOLS package has been split into three separate repositories including Samtools, BCFtools and HTSlib. Samtools for manipulating next generation sequencing data used for reading, writing, editing, indexing,viewing nucleotide alignments in SAM,BAM,CRAM format. BCFtools used for reading, writing BCF2,VCF, gVCF files and calling, filtering, summarising SNP and short indel sequence variants. HTSlib used for reading, writing high throughput sequencing data.

View all literature mentions

GraphPad Prism (software resource)

RRID:SCR_002798

Statistical analysis software that combines scientific graphing, comprehensive curve fitting (nonlinear regression), understandable statistics, and data organization. Designed for biological research applications in pharmacology, physiology, and other biological fields for data analysis, hypothesis testing, and modeling.

View all literature mentions

ToppGene Suite (data analysis service)

RRID:SCR_005726

ToppGene Suite is a one-stop portal for gene list enrichment analysis and candidate gene prioritization based on functional annotations and protein interactions network. ToppGene Suite is a one-stop portal for (i) gene list functional enrichment, (ii) candidate gene prioritization using either functional annotations or network analysis and (iii) identification and prioritization of novel disease candidate genes in the interactome. Functional annotation-based disease candidate gene prioritization uses a fuzzy-based similarity measure to compute the similarity between any two genes based on semantic annotations. The similarity scores from individual features are combined into an overall score using statistical meta-analysis.

View all literature mentions

Qvalue (software resource)

RRID:SCR_001073

R package that takes a list of p-values resulting from the simultaneous testing of hypotheses and estimates their q-values. It is designed to measure the proportion of false positives when a test is significant. The software is capable of generating plots for visualization. It can be applied to problems in genomics, brain imaging, astrophysics, and data mining.

View all literature mentions

PEER (software resource)

RRID:SCR_009326

Software collection of Bayesian approaches to infer hidden determinants and their effects from gene expression profiles using factor analysis methods. Applications of PEER have * detected batch effects and experimental confounders * increased the number of expression QTL findings by threefold * allowed inference of intermediate cellular traits, such as transcription factor or pathway activations This project offers an efficient and versatile C++ implementation of the underlying algorithms with user-friendly interfaces to R and python.

View all literature mentions

EQTL EXPLORER (software resource)

RRID:SCR_001123

THIS RESOURCE IS NO LONGER IN SERVICE. Documented on September 23,2022. An eQTL visualization tool that allows users to mine and understand data from a repository of genetical genomics experiments (entry from Genetic Analysis Software)

View all literature mentions

PheWAS R Package (software resource)

RRID:SCR_003512

Software package contains methods for performing Phenome-Wide Association Study.

View all literature mentions

RSEM (software resource)

RRID:SCR_013027

Software package for quantifying gene and isoform abundances from single end or paired end RNA Seq data. Accurate transcript quantification from RNA Seq data with or without reference genome. Used for accurate quantification of gene and isoform expression from RNA-Seq data.

View all literature mentions

Systems Transcriptional Activity Reconstruction (data or information resource)

RRID:SCR_005622

A next-generation web-based application that aims to provide an integrated solution for both visualization and analysis of deep-sequencing data, along with simple access to public datasets.

View all literature mentions

Trim Galore (data processing software)

RRID:SCR_011847

Software tool to automate quality and adapter trimming as well as quality control, with some added functionality to remove biased methylation positions for RRBS sequence files for directional, non-directional or paired-end sequencing. Wrapper around Cutadapt and FastQC to consistently apply adapter and quality trimming to FastQ files, with extra functionality for Reduced Representation Bisulfite Sequencing data.

View all literature mentions

FastQC (software resource)

RRID:SCR_014583

Quality control software that perform checks on raw sequence data coming from high throughput sequencing pipelines. This software also provides a modular set of analyses which can give a quick impression of the quality of the data prior to further analysis.

View all literature mentions

Eigensoft (software resource)

RRID:SCR_004965

EIGENSOFT package combines functionality from our population genetics methods (Patterson et al. 2006) and our EIGENSTRAT stratification method (Price et al. 2006). The EIGENSTRAT method uses principal components analysis to explicitly model ancestry differences between cases and controls along continuous axes of variation; the resulting correction is specific to a candidate marker''s variation in frequency across ancestral populations, minimizing spurious associations while maximizing power to detect true associations. The EIGENSOFT package has a built-in plotting script and supports multiple file formats and quantitative phenotypes. Source code, documentation and executables for using EIGENSOFT 3.0 on a Linux platform can be downloaded. New features of EIGENSOFT 3.0 include supporting either 32-bit or 64-bit Linux machines, a utility to merge different data sets, a utility to identify related samples (accounting for population structure), and supporting multiple file formats for EIGENSTRAT stratification correction.

View all literature mentions

IMPUTE2 (software resource)

RRID:SCR_013055

A computer program for phasing observed genotypes and imputing missing genotypes.

View all literature mentions

FlowJo (software resource)

RRID:SCR_008520

Software for single-cell flow cytometry analysis. Its functions include management, display, manipulation, analysis and publication of the data stream produced by flow and mass cytometers.

View all literature mentions

WA07 (cell line)

RRID:CVCL_9772

Cell line WA07 is a Embryonic stem cell with a species of origin Homo sapiens

View all literature mentions

HUES 8 (cell line)

RRID:CVCL_B207

Cell line HUES 8 is a Embryonic stem cell with a species of origin Homo sapiens (Human)

View all literature mentions

NIH 3T3 (cell line)

RRID:CVCL_0594

Cell line NIH 3T3 is a Spontaneously immortalized cell line with a species of origin Mus musculus

View all literature mentions

Hep-G2 (cell line)

RRID:CVCL_0027

Cell line Hep-G2 is a Cancer cell line with a species of origin Homo sapiens (Human)

View all literature mentions

HEK293T (cell line)

RRID:CVCL_0063

Cell line HEK293T is a Transformed cell line with a species of origin Homo sapiens (Human)

View all literature mentions

C57BL/6J (organism)

RRID:IMSR_JAX:000664

Mus musculus with name C57BL/6J from IMSR.

View all literature mentions

C57BL/6J (organism)

RRID:IMSR_JAX:000664

Mus musculus with name C57BL/6J from IMSR.

View all literature mentions