2024MAY10: Our hosting provider is experiencing intermittent networking issues. We apologize for any inconvenience.

Searching across hundreds of databases

Our searching services are busy right now. Your search will reload in five seconds.

X
Forgot Password

If you have forgotten your password you can enter your email here and get a temporary password sent to your email.

X
Forgot Password

If you have forgotten your password you can enter your email here and get a temporary password sent to your email.

Mapping asthma-associated variants in admixed populations.

Frontiers in genetics | 2015

Admixed populations arise when two or more previously isolated populations interbreed. Mapping asthma susceptibility loci in an admixed population using admixture mapping (AM) involves screening the genome of individuals of mixed ancestry for chromosomal regions that have a higher frequency of alleles from a parental population with higher asthma risk as compared with parental population with lower asthma risk. AM takes advantage of the admixture created in populations of mixed ancestry to identify genomic regions where an association exists between genetic ancestry and asthma (in contrast to between the genotype of the marker and asthma). The theory behind AM is that chromosomal segments of affected individuals contain a significantly higher-than-average proportion of alleles from the high-risk parental population and thus are more likely to harbor disease-associated loci. Criteria to evaluate the applicability of AM as a gene mapping approach include: (1) the prevalence of the disease differences in ancestral populations from which the admixed population was formed; (2) a measurable difference in disease-causing alleles between the parental populations; (3) reduced linkage disequilibrium (LD) between unlinked loci across chromosomes and strong LD between neighboring loci; (4) a set of markers with noticeable allele-frequency differences between parental populations that contributes to the admixed population (single nucleotide polymorphisms (SNPs) are the markers of choice because they are abundant, stable, relatively cheap to genotype, and informative with regard to the LD structure of chromosomal segments); and (5) there is an understanding of the extent of segmental chromosomal admixtures and their interactions with environmental factors. Although genome-wide association studies have contributed greatly to our understanding of the genetic components of asthma, the large and increasing degree of admixture in populations across the world create many challenges for further efforts to map disease-causing genes. This review, summarizes the historical context of admixed populations and AM, and considers current opportunities to use AM to map asthma genes. In addition, we provide an overview of the potential limitations and future directions of AM in biomedical research, including joint admixture and association mapping for asthma and asthma-related disorders.

Pubmed ID: 26483834 RIS Download

Publication data is provided by the National Library of Medicine ® and PubMed ®. Data is retrieved from PubMed ® on a weekly schedule. For terms and conditions see the National Library of Medicine Terms and Conditions.

This is a list of tools and resources that we have found mentioned in this publication.


SABER (tool)

RRID:SCR_001257

Software program suitable for genome-scale data which uses a Markov-hidden Markov model (MHMM) to estimate local ancestry. The MHMM makes it possible to identify genomic blocks of a particular ancestry by use of any high-density single-nucleotide-polymorphism panel. One application is to perform admixture mapping without genotyping special ancestry-informative-marker panels.

View all literature mentions

Local Ancestry in adMixed Populations (tool)

RRID:SCR_001258

A software package for the inference of locus-specific ancestry in recently admixed populations. LAMP-LD takes the genotypes of admixed individuals as well as reference haplotype panels approximating the mixing ancestral populations, and outputs the estimated number of alleles from each ancestry in each locus for each individual. The LAMP-LD package also includes the program LAMP-HAP, which processes haplotype data when high-quality phasing is available, and utilizes trio nuclear family designs to improve estimation accuracy. LAMP-LD is based on a window-based processing combined within a hierarchical Hidden Markov Model. It can process 2,3 or 5 mixing populations, and its short per-sample processing time makes it suitable for analyzing large datasets of dense SNP panels. The original program LAMP does not use the LD and therefore is not as accurate, but it is useful in cases where the SNP density is not high enough or when the ancestral haplotypes are unkown.

View all literature mentions

GemTools (tool)

RRID:SCR_001259

THIS RESOURCE IS NO LONGER IN SERVICE. Documented on September 23,2022. Software tools for modeling genetic ancestry based on the single nucleotide polymorphism (SNP) information. This package of functions helps the user account for genetic ancestry of a large number of individuals using spectral graph theory and projections to break a large problem into smaller pieces and calculate genetic ancestry information efficiently, i.e., a divide and conquer (dac) strategy. It is completely written in R and runs on any platform that supports R.

View all literature mentions

frappe (tool)

RRID:SCR_001264

Software using a f frequentist approach for estimating individual ancestry proportion.

View all literature mentions

CRAN (tool)

RRID:SCR_003005

Network of ftp and web servers around world that store identical, up to date, versions of code and documentation for R. Package archive network for R programming language.

View all literature mentions

NGSadmix (tool)

RRID:SCR_003208

A tool for finding admixture proportions from next generation sequencing (NGS) data that is based on genotype likelihoods. It is a multithreaded c/c++ program.

View all literature mentions

Eigensoft (tool)

RRID:SCR_004965

EIGENSOFT package combines functionality from our population genetics methods (Patterson et al. 2006) and our EIGENSTRAT stratification method (Price et al. 2006). The EIGENSTRAT method uses principal components analysis to explicitly model ancestry differences between cases and controls along continuous axes of variation; the resulting correction is specific to a candidate marker''s variation in frequency across ancestral populations, minimizing spurious associations while maximizing power to detect true associations. The EIGENSOFT package has a built-in plotting script and supports multiple file formats and quantitative phenotypes. Source code, documentation and executables for using EIGENSOFT 3.0 on a Linux platform can be downloaded. New features of EIGENSOFT 3.0 include supporting either 32-bit or 64-bit Linux machines, a utility to merge different data sets, a utility to identify related samples (accounting for population structure), and supporting multiple file formats for EIGENSTRAT stratification correction.

View all literature mentions

1000 Genomes: A Deep Catalog of Human Genetic Variation (tool)

RRID:SCR_006828

International collaboration producing an extensive public catalog of human genetic variation, including SNPs and structural variants, and their haplotype contexts, in an effort to provide a foundation for investigating the relationship between genotype and phenotype. The genomes of about 2500 unidentified people from about 25 populations around the world were sequenced using next-generation sequencing technologies. Redundant sequencing on various platforms and by different groups of scientists of the same samples can be compared. The results of the study are freely and publicly accessible to researchers worldwide. The consortium identified the following populations whose DNA will be sequenced: Yoruba in Ibadan, Nigeria; Japanese in Tokyo; Chinese in Beijing; Utah residents with ancestry from northern and western Europe; Luhya in Webuye, Kenya; Maasai in Kinyawa, Kenya; Toscani in Italy; Gujarati Indians in Houston; Chinese in metropolitan Denver; people of Mexican ancestry in Los Angeles; and people of African ancestry in the southwestern United States. The goal Project is to find most genetic variants that have frequencies of at least 1% in the populations studied. Sequencing is still too expensive to deeply sequence the many samples being studied for this project. However, any particular region of the genome generally contains a limited number of haplotypes. Data can be combined across many samples to allow efficient detection of most of the variants in a region. The Project currently plans to sequence each sample to about 4X coverage; at this depth sequencing cannot provide the complete genotype of each sample, but should allow the detection of most variants with frequencies as low as 1%. Combining the data from 2500 samples should allow highly accurate estimation (imputation) of the variants and genotypes for each sample that were not seen directly by the light sequencing. All samples from the 1000 genomes are available as lymphoblastoid cell lines (LCLs) and LCL derived DNA from the Coriell Cell Repository as part of the NHGRI Catalog. The sequence and alignment data generated by the 1000genomes project is made available as quickly as possible via their mirrored ftp sites. ftp://ftp.1000genomes.ebi.ac.uk ftp://ftp-trace.ncbi.nlm.nih.gov/1000genomes

View all literature mentions

STRAT (tool)

RRID:SCR_009110

Software application that is a companion program to STRUCTURE that is a structured association method, for use in association mapping, enabling valid case-control studies even in the presence of population structure. (entry from Genetic Analysis Software)

View all literature mentions

TREESELECT (tool)

RRID:SCR_009422

THIS RESOURCE IS NO LONGER IN SERVICE. Documented on May 6th,2023. Software package for inferring natural selection from unusual population differentiation between closely related populations. (entry from Genetic Analysis Software)

View all literature mentions

Illumina (tool)

RRID:SCR_010233

American company incorporated that develops, manufactures and markets integrated systems for the analysis of genetic variation and biological function. Provides a line of products and services that serve the sequencing, genotyping and gene expression and proteomics markets. Its headquarters are located in San Diego, California.

View all literature mentions

GWAS: Catalog of Published Genome-Wide Association Studies (tool)

RRID:SCR_012745

Catalog of published genome-wide association studies. Genome-wide set of genetic variants in different individuals to see if any variant is associated with trait and disease. Database of genome-wide association study (GWAS) publications including only those attempting to assay single nucleotide polymorphisms (SNPs). Publications are organized from most to least recent date of publication. Studies are identified through weekly PubMed literature searches, daily NIH-distributed compilations of news and media reports, and occasional comparisons with an existing database of GWAS literature (HuGE Navigator). Works with HANCESTRO ancestry representation.

View all literature mentions

ADMIXMAP (tool)

RRID:SCR_009035

General-purpose program for modelling admixture, using marker genotypes and trait data on a sample of individuals from an admixed population (such as African-Americans), where the markers have been chosen to have extreme differentials in allele frequencies between two or more of the ancestral populations between which admixture has occurred. The main difference between ADMIXMAP and classical programs for estimation of admixture such as ADMIX is that ADMIXMAP is based on a multilevel model for the distribution of individual admixture in the population and the stochastic variation of ancestry on hybrid chromosomes. This makes it possible to model the associations of ancestry between linked marker loci, and the association of a trait with individual admixture or with ancestry at a linked marker locus. (entry from Genetic Analysis Software)

View all literature mentions

ADMIXTURE (tool)

RRID:SCR_001263

A software tool for maximum likelihood estimation of individual ancestries from multilocus SNP genotype datasets. It uses the same statistical model as STRUCTURE but calculates estimates much more rapidly using a fast numerical optimization algorithm. It uses a block relaxation approach to alternately update allele frequency and ancestry fraction parameters. Each block update is handled by solving a large number of independent convex optimization problems, which are tackled using a fast sequential quadratic programming algorithm. Convergence of the algorithm is accelerated using a novel quasi-Newton acceleration method.

View all literature mentions

International HapMap Project (tool)

RRID:SCR_002846

THIS RESOURCE IS NO LONGER IN SERVICE, documented August 22, 2016. A multi-country collaboration among scientists and funding agencies to develop a public resource where genetic similarities and differences in human beings are identified and catalogued. Using this information, researchers will be able to find genes that affect health, disease, and individual responses to medications and environmental factors. All of the information generated by the Project will be released into the public domain. Their goal is to compare the genetic sequences of different individuals to identify chromosomal regions where genetic variants are shared. Public and private organizations in six countries are participating in the International HapMap Project. Data generated by the Project can be downloaded with minimal constraints. HapMap project related data, software, and documentation include: bulk data on genotypes, frequencies, LD data, phasing data, allocated SNPs, recombination rates and hotspots, SNP assays, Perlegen amplicons, raw data, inferred genotypes, and mitochondrial and chrY haplogroups; Generic Genome Browser software; protocols and information on assay design, genotyping and other protocols used in the project; and documentation of samples/individuals and the XML format used in the project.

View all literature mentions

Hapmix (tool)

RRID:SCR_004203

Software application that uses genotyping data from SNP arrays for accurately inferring chromosomal segments of distinct continental ancestry in admixed populations, using dense genetic data. (entry from Genetic Analysis Software)

View all literature mentions

University of North Carolina at Chapel Hill; North Carolina; USA (tool)

RRID:SCR_006100

The University of North Carolina at Chapel Hill, also known as UNC-Chapel Hill, Chapel Hill, North Carolina, or simply Carolina is a public research university in Chapel Hill, North Carolina. It is the flagship of the 17 campuses of the University of North Carolina system.

View all literature mentions

1000 Genomes Project and AWS (tool)

RRID:SCR_008801

A dataset containing the full genomic sequence of 1,700 individuals, freely available for research use. The 1000 Genomes Project is an international research effort coordinated by a consortium of 75 companies and organizations to establish the most detailed catalogue of human genetic variation. The project has grown to 200 terabytes of genomic data including DNA sequenced from more than 1,700 individuals that researchers can now access on AWS for use in disease research free of charge. The dataset containing the full genomic sequence of 1,700 individuals is now available to all via Amazon S3. The data can be found at: http://s3.amazonaws.com/1000genomes The 1000 Genomes Project aims to include the genomes of more than 2,662 individuals from 26 populations around the world, and the NIH will continue to add the remaining genome samples to the data collection this year. Public Data Sets on AWS provide a centralized repository of public data hosted on Amazon Simple Storage Service (Amazon S3). The data can be seamlessly accessed from AWS services such Amazon Elastic Compute Cloud (Amazon EC2) and Amazon Elastic MapReduce (Amazon EMR), which provide organizations with the highly scalable compute resources needed to take advantage of these large data collections. AWS is storing the public data sets at no charge to the community. Researchers pay only for the additional AWS resources they need for further processing or analysis of the data. All 200 TB of the latest 1000 Genomes Project data is available in a publicly available Amazon S3 bucket. You can access the data via simple HTTP requests, or take advantage of the AWS SDKs in languages such as Ruby, Java, Python, .NET and PHP. Researchers can use the Amazon EC2 utility computing service to dive into this data without the usual capital investment required to work with data at this scale. AWS also provides a number of orchestration and automation services to help teams make their research available to others to remix and reuse. Making the data available via a bucket in Amazon S3 also means that customers can crunch the information using Hadoop via Amazon Elastic MapReduce, and take advantage of the growing collection of tools for running bioinformatics job flows, such as CloudBurst and Crossbow.

View all literature mentions