Forgot Password

If you have forgotten your password you can enter your email here and get a temporary password sent to your email.

Forgot Password

If you have forgotten your password you can enter your email here and get a temporary password sent to your email.

Publication

Variant association tools for quality control and analysis of large-scale sequence and genotyping array data.

Gao T Wang | Bo Peng | Suzanne M Leal

American journal of human genetics | 2014

Currently there is great interest in detecting associations between complex traits and rare variants. In this report, we describe Variant Association Tools (VAT) and the VAT pipeline, which implements best practices for rare-variant association studies. Highlights of VAT include variant-site and call-level quality control (QC), summary statistics, phenotype- and genotype-based sample selection, variant annotation, selection of variants for association analysis, and a collection of rare-variant association methods for analyzing qualitative and quantitative traits. The association testing framework for VAT is regression based, which readily allows for flexible construction of association models with multiple covariates and weighting themes based on allele frequencies or predicted functionality. Additionally, pathway analyses, conditional analyses, and analyses of gene-gene and gene-environment interactions can be performed. VAT is capable of rapidly scanning through data by using multi-process computation, adaptive permutation, and simultaneously conducting association analysis via multiple methods. Results are available in text or graphic file formats and additionally can be output to relational databases for further annotation and filtering. An interface to R language also facilitates user implementation of novel association methods. The VAT's data QC and association-analysis pipeline can be applied to sequence, imputed, and genotyping array, e.g., "exome chip," data, providing a reliable and reproducible computational environment in which to analyze small- to large-scale studies with data from the latest genotyping and sequencing technologies. Application of the VAT pipeline is demonstrated through analysis of data from the 1000 Genomes project.

Pubmed ID: 24791902 RIS Download

Research resources used in this publication

None found

Additional research tools detected in this publication

Antibodies used in this publication

None found

Associated grants

Agency: NCI NIH HHS, United States
Id: P30 CA016672
Agency: NHLBI NIH HHS, United States
Id: RC2 HL102926
Agency: NIMHD NIH HHS, United States
Id: RC4 MD005964
Agency: NHGRI NIH HHS, United States
Id: R01 HG005859
Agency: NHGRI NIH HHS, United States
Id: 1R01HG005859
Agency: NHGRI NIH HHS, United States
Id: HG006493
Agency: NHLBI NIH HHS, United States
Id: HL102926
Agency: NHGRI NIH HHS, United States
Id: UM1 HG006493
Agency: NIMHD NIH HHS, United States
Id: MD005964
Agency: NHGRI NIH HHS, United States
Id: U54 HG006493
Agency: NHLBI NIH HHS, United States
Id: UC2 HL102926

Publication data is provided by the National Library of Medicine ® and PubMed ®. Data is retrieved from PubMed ® on a weekly schedule. For terms and conditions see the National Library of Medicine Terms and Conditions.

This is a list of tools and resources that we have found mentioned in this publication.

COSMIC - Catalogue Of Somatic Mutations In Cancer (tool)

RRID:SCR_002260

Database to store and display somatic mutation information and related details and contains information relating to human cancers. The mutation data and associated information is extracted from the primary literature. In order to provide a consistent view of the data a histology and tissue ontology has been created and all mutations are mapped to a single version of each gene. The data can be queried by tissue, histology or gene and displayed as a graph, as a table or exported in various formats.
Some key features of COSMIC are:
* Contains information on publications, samples and mutations. Includes samples which have been found to be negative for mutations during screening therefore enabling frequency data to be calculated for mutations in different genes in different cancer types.
* Samples entered include benign neoplasms and other benign proliferations, in situ and invasive tumours, recurrences, metastases and cancer cell lines.

View all literature mentions

OMIM (tool)

RRID:SCR_006437

Online catalog of human genes and genetic disorders, for clinical features, phenotypes and genes. Collection of human genes and genetic phenotypes, focusing on relationship between phenotype and genotype. Referenced overviews in OMIM contain information on all known mendelian disorders and variety of related genes. It is updated daily, and entries contain copious links to other genetics resources.

View all literature mentions

About

The SciCrunch Infrastructure was developed as a cooperative data platform to be used by diverse communities in making data more FAIR.

Contact Us

FAIR Data Informatics Lab

University of California, San Diego

9500 Gilman Drive, Mail Code 0608

La Jolla, CA 92093-0608

United States

info

scicrunch.org

About SciCrunch | Privacy Policy | Terms of Service