Searching across hundreds of databases

Our searching services are busy right now. Your search will reload in five seconds.

X
Forgot Password

If you have forgotten your password you can enter your email here and get a temporary password sent to your email.

X
Forgot Password

If you have forgotten your password you can enter your email here and get a temporary password sent to your email.

Parallel and Intertwining Threads of Domestication in Allopolyploid Cotton.

Advanced science (Weinheim, Baden-Wurttemberg, Germany) | 2021

The two cultivated allopolyploid cottons, Gossypium hirsutum and Gossypium barbadense, represent a remarkable example of parallel independent domestication, both involving dramatic morphological transformations under selection from wild perennial plants to annualized row crops. Deep resequencing of 643 newly sampled accessions spanning the wild-to-domesticated continuum of both species, and their allopolyploid relatives, are combined with existing data to resolve species relationships and elucidate multiple aspects of their parallel domestication. It is confirmed that wild G. hirsutum and G. barbadense were initially domesticated in the Yucatan Peninsula and NW South America, respectively, and subsequently spread under domestication over 4000-8000 years to encompass most of the American tropics. A robust phylogenomic analysis of infraspecific relationships in each species is presented, quantify genetic diversity in both, and describe genetic bottlenecks associated with domestication and subsequent diffusion. As these species became sympatric over the last several millennia, pervasive genome-wide bidirectional introgression occurred, often with striking asymmetries involving the two co-resident genomes of these allopolyploids. Diversity scans revealed genomic regions and genes unknowingly targeted during domestication and additional subgenomic asymmetries. These analyses provide a comprehensive depiction of the origin, divergence, and adaptation of cotton, and serve as a rich resource for cotton improvement.

Pubmed ID: 34026441 RIS Download

Publication data is provided by the National Library of Medicine ® and PubMed ®. Data is retrieved from PubMed ® on a weekly schedule. For terms and conditions see the National Library of Medicine Terms and Conditions.

This is a list of tools and resources that we have found mentioned in this publication.


VCFtools (tool)

RRID:SCR_001235

Software package for working with VCF files. Used to provide easily accessible methods for working with complex genetic variation data in the form of VCF files.Implements various utilities for processing Variant Call Format files, including validation, merging, comparing. Provides general Perl API.

View all literature mentions

GATK (tool)

RRID:SCR_001876

A software package to analyze next-generation resequencing data. The toolkit offers a wide variety of tools, with a primary focus on variant discovery and genotyping as well as strong emphasis on data quality assurance. Its robust architecture, powerful processing engine and high-performance computing features make it capable of taking on projects of any size. This software library makes writing efficient analysis tools using next-generation sequencing data very easy, and second it's a suite of tools for working with human medical resequencing projects such as 1000 Genomes and The Cancer Genome Atlas. These tools include things like a depth of coverage analyzers, a quality score recalibrator, a SNP/indel caller and a local realigner. (entry from Genetic Analysis Software)

View all literature mentions

SAMTOOLS (tool)

RRID:SCR_002105

Original SAMTOOLS package has been split into three separate repositories including Samtools, BCFtools and HTSlib. Samtools for manipulating next generation sequencing data used for reading, writing, editing, indexing,viewing nucleotide alignments in SAM,BAM,CRAM format. BCFtools used for reading, writing BCF2,VCF, gVCF files and calling, filtering, summarising SNP and short indel sequence variants. HTSlib used for reading, writing high throughput sequencing data.

View all literature mentions

STRUCTURE (tool)

RRID:SCR_002151

Software package for using multi locus genotype data to investigate population structure. Used for inferring presence of distinct populations, assigning individuals to populations, studying hybrid zones, identifying migrants and admixed individuals, and estimating population allele frequencies in situations where many individuals are migrants or admixed. Can be applied to most of commonly used genetic markers, including SNPS, microsatellites, RFLPs and Amplified Fragment Length Polymorphisms.

View all literature mentions

STAR (tool)

RRID:SCR_004463

Software performing alignment of high-throughput RNA-seq data. Aligns RNA-seq reads to reference genome using uncompressed suffix arrays.

View all literature mentions

Eigensoft (tool)

RRID:SCR_004965

EIGENSOFT package combines functionality from our population genetics methods (Patterson et al. 2006) and our EIGENSTRAT stratification method (Price et al. 2006). The EIGENSTRAT method uses principal components analysis to explicitly model ancestry differences between cases and controls along continuous axes of variation; the resulting correction is specific to a candidate marker''s variation in frequency across ancestral populations, minimizing spurious associations while maximizing power to detect true associations. The EIGENSOFT package has a built-in plotting script and supports multiple file formats and quantitative phenotypes. Source code, documentation and executables for using EIGENSOFT 3.0 on a Linux platform can be downloaded. New features of EIGENSOFT 3.0 include supporting either 32-bit or 64-bit Linux machines, a utility to merge different data sets, a utility to identify related samples (accounting for population structure), and supporting multiple file formats for EIGENSTRAT stratification correction.

View all literature mentions

SnpEff (tool)

RRID:SCR_005191

Genetic variant annotation and effect prediction software toolbox that annotates and predicts effects of variants on genes (such as amino acid changes). By using standards, such as VCF, SnpEff makes it easy to integrate with other programs.

View all literature mentions

RAxML (tool)

RRID:SCR_006086

Software program for phylogenetic analyses of large datasets under maximum likelihood.

View all literature mentions

Picard (tool)

RRID:SCR_006525

Java toolset for working with next generation sequencing data in the BAM format.

View all literature mentions

BEDTools (tool)

RRID:SCR_006646

A powerful toolset for genome arithmetic allowing one to address common genomics tasks such as finding feature overlaps and computing coverage. Bedtools allows one to intersect, merge, count, complement, and shuffle genomic intervals from multiple files in widely-used genomic file formats such as BAM, BED, GFF/GTF, VCF. While each individual tool is designed to do a relatively simple task (e.g., intersect two interval files), quite sophisticated analyses can be conducted by combining multiple bedtools operations on the UNIX command line.

View all literature mentions

FreeBayes (tool)

RRID:SCR_010761

A Bayesian genetic variant detector designed to find small polymorphisms, specifically SNPs, indels, MNPs, and complex events smaller than the length of a short-read sequencing alignment.

View all literature mentions

Cufflinks (tool)

RRID:SCR_014597

Software tool for transcriptome assembly and differential expression analysis for RNA-Seq. Includes script called cuffmerge that can be used to merge together several Cufflinks assemblies. It also handles running Cuffcompare as well as automatically filtering a number of transfrags that are likely to be artifacts. If the researcher has a reference GTF file, the researcher can provide it to the script to more effectively merge novel isoforms and maximize overall assembly quality.

View all literature mentions

SOAPnuke (tool)

RRID:SCR_015025

Multi-threaded software for rapid quality control and preprocessing of high throughput sequencing data specified for different experiments. It consists of four modules that speed up the report on statistics graphs of raw datasets, preprocessed datasets and preprocessing status.

View all literature mentions

Structure Harvester (tool)

RRID:SCR_017636

Web based program for collating results generated by program STRUCTURE. Provides assess and visualize likelihood values across multiple values of K and hundreds of iterations for easier detection of number of genetic groups that best fit data. Reformats data for use in downstream programs, such as CLUMPP.It is complement for using software Structure in genetics population. Website and program for visualizing STRUCTURE output and implementing Evanno method.

View all literature mentions

STRUCTURE (tool)

RRID:SCR_017637

Software package for using multi locus genotype data to investigate population structure. Used for inferring presence of distinct populations, assigning individuals to populations, studying hybrid zones, identifying migrants and admixed individuals, and estimating population allele frequencies in situations where many individuals are migrants or admixed. Can be applied to most of commonly used genetic markers, including SNPS, microsatellites, RFLPs and Amplified Fragment Length Polymorphisms.

View all literature mentions