Searching across hundreds of databases

Our searching services are busy right now. Your search will reload in five seconds.

  • Register
X
Forgot Password

If you have forgotten your password you can enter your email here and get a temporary password sent to your email.

X

Leaving Community

Are you sure you want to leave this community? Leaving the community will revoke any permissions you have been granted in this community.

No
Yes

1000 Genomes: A Deep Catalog of Human Genetic Variation (RRID:SCR_006828)


http://www.1000genomes.org/

International collaboration producing an extensive public catalog of human genetic variation, including SNPs and structural variants, and their haplotype contexts, in an effort to provide a foundation for investigating the relationship between genotype and phenotype. The genomes of about 2500 unidentified people from about 25 populations around the world were sequenced using next-generation sequencing technologies. Redundant sequencing on various platforms and by different groups of scientists of the same samples can be compared. The results of the study are freely and publicly accessible to researchers worldwide. The consortium identified the following populations whose DNA will be sequenced: Yoruba in Ibadan, Nigeria; Japanese in Tokyo; Chinese in Beijing; Utah residents with ancestry from northern and western Europe; Luhya in Webuye, Kenya; Maasai in Kinyawa, Kenya; Toscani in Italy; Gujarati Indians in Houston; Chinese in metropolitan Denver; people of Mexican ancestry in Los Angeles; and people of African ancestry in the southwestern United States. The goal Project is to find most genetic variants that have frequencies of at least 1% in the populations studied. Sequencing is still too expensive to deeply sequence the many samples being studied for this project. However, any particular region of the genome generally contains a limited number of haplotypes. Data can be combined across many samples to allow efficient detection of most of the variants in a region. The Project currently plans to sequence each sample to about 4X coverage; at this depth sequencing cannot provide the complete genotype of each sample, but should allow the detection of most variants with frequencies as low as 1%. Combining the data from 2500 samples should allow highly accurate estimation (imputation) of the variants and genotypes for each sample that were not seen directly by the light sequencing. All samples from the 1000 genomes are available as lymphoblastoid cell lines (LCLs) and LCL derived DNA from the Coriell Cell Repository as part of the NHGRI Catalog. The sequence and alignment data generated by the 1000genomes project is made available as quickly as possible via their mirrored ftp sites. ftp://ftp.1000genomes.ebi.ac.uk ftp://ftp-trace.ncbi.nlm.nih.gov/1000genomes


Keywords

genetic variation, gene, next-generation sequencing, sequence, alignment, genome, single-nucleotide polymorphism, structural variant, haplotype, genome-wide association study, pharmacology, genetics, biomarker, consortium, data sharing, genotype, phenotype

Resource ID

SCR_006828

Alternate IDs

nlx_143819, OMICS_00261

Website Status

Last checked up

Abbreviation(s)

1000 Genomes

Species

human

Resource Type

Resource, organization portal, database, consortium, data set, portal, data or information resource

Funding Information

Wellcome Trust Sanger Institute; Hinxton; United Kingdom, Beijing Genomics Institute; Shenzhen; China, NHGRI, 454 Life Sciences Roche, Life Technologies, Illumina

Used By

BioSample Database at EBI

Uses

NHGRI Sample Repository for Human Genetic Research

Availability

Free, Public, Restrictions apply, Http://www.1000genomes.org/data#DataAccess

Synonym(s)

International 1000 Genomes Project, 1000 Genomes Project

Proper citation

1000 Genomes: A Deep Catalog of Human Genetic Variation, RRID:SCR_006828

Other resources frequently mentioned in the literature with this resource

SciCrunch Registry

Interactive portal for finding and submitting biomedical resources. Resources within SciCrunch are assigned RRIDs which are used to cite resources in scientific manuscripts.

Co-mentions heatmap

Load a heatmap of the top 20 resources that share the most co-mentions with this resource in the literature.


  1. Information

    Information on this specific resource.

  2. Relationships

    See other resources that this resources is related to.

  3. References

    Publications describing this resource.

  4. Referenced by

    Publications that reference this resource. These references are discovered by human submissions and automated crawling through various journals.

  5. Analytics

    Search for other resources that are referenced by publications that reference this resource.

  6. Data

    This resource is also a data repository used by SciCrunch. Search through the data.

  7. Data Licenses

    The licenses the data is under.

  8. Source

    The data repository this resource is listed from.