2024MAY10: Our hosting provider is experiencing intermittent networking issues. We apologize for any inconvenience.

Searching across hundreds of databases

Our searching services are busy right now. Your search will reload in five seconds.

X
Forgot Password

If you have forgotten your password you can enter your email here and get a temporary password sent to your email.

X
Forgot Password

If you have forgotten your password you can enter your email here and get a temporary password sent to your email.

Dense sampling of bird diversity increases power of comparative genomics.

Shaohong Feng | Josefin Stiller | Yuan Deng | Joel Armstrong | Qi Fang | Andrew Hart Reeve | Duo Xie | Guangji Chen | Chunxue Guo | Brant C Faircloth | Bent Petersen | Zongji Wang | Qi Zhou | Mark Diekhans | Wanjun Chen | Sergio Andreu-Sánchez | Ashot Margaryan | Jason Travis Howard | Carole Parent | George Pacheco | Mikkel-Holger S Sinding | Lara Puetz | Emily Cavill | Ângela M Ribeiro | Leopold Eckhart | Jon Fjeldså | Peter A Hosner | Robb T Brumfield | Les Christidis | Mads F Bertelsen | Thomas Sicheritz-Ponten | Dieter Thomas Tietze | Bruce C Robertson | Gang Song | Gerald Borgia | Santiago Claramunt | Irby J Lovette | Saul J Cowen | Peter Njoroge | John Philip Dumbacher | Oliver A Ryder | Jérôme Fuchs | Michael Bunce | David W Burt | Joel Cracraft | Guanliang Meng | Shannon J Hackett | Peter G Ryan | Knud Andreas Jønsson | Ian G Jamieson | Rute R da Fonseca | Edward L Braun | Peter Houde | Siavash Mirarab | Alexander Suh | Bengt Hansson | Suvi Ponnikas | Hanna Sigeman | Martin Stervander | Paul B Frandsen | Henriette van der Zwan | Rencia van der Sluis | Carina Visser | Christopher N Balakrishnan | Andrew G Clark | John W Fitzpatrick | Reed Bowman | Nancy Chen | Alison Cloutier | Timothy B Sackton | Scott V Edwards | Dustin J Foote | Subir B Shakya | Frederick H Sheldon | Alain Vignal | André E R Soares | Beth Shapiro | Jacob González-Solís | Joan Ferrer-Obiol | Julio Rozas | Marta Riutort | Anna Tigano | Vicki Friesen | Love Dalén | Araxi O Urrutia | Tamás Székely | Yang Liu | Michael G Campana | André Corvelo | Robert C Fleischer | Kim M Rutherford | Neil J Gemmell | Nicolas Dussex | Henrik Mouritsen | Nadine Thiele | Kira Delmore | Miriam Liedvogel | Andre Franke | Marc P Hoeppner | Oliver Krone | Adam M Fudickar | Borja Milá | Ellen D Ketterson | Andrew Eric Fidler | Guillermo Friis | Ángela M Parody-Merino | Phil F Battley | Murray P Cox | Nicholas Costa Barroso Lima | Francisco Prosdocimi | Thomas Lee Parchman | Barney A Schlinger | Bette A Loiselle | John G Blake | Haw Chuan Lim | Lainy B Day | Matthew J Fuxjager | Maude W Baldwin | Michael J Braun | Morgan Wirthlin | Rebecca B Dikow | T Brandt Ryder | Glauco Camenisch | Lukas F Keller | Jeffrey M DaCosta | Mark E Hauber | Matthew I M Louder | Christopher C Witt | Jimmy A McGuire | Joann Mudge | Libby C Megna | Matthew D Carling | Biao Wang | Scott A Taylor | Glaucia Del-Rio | Alexandre Aleixo | Ana Tereza Ribeiro Vasconcelos | Claudio V Mello | Jason T Weir | David Haussler | Qiye Li | Huanming Yang | Jian Wang | Fumin Lei | Carsten Rahbek | M Thomas P Gilbert | Gary R Graves | Erich D Jarvis | Benedict Paten | Guojie Zhang
Nature | 2020

Whole-genome sequencing projects are increasingly populating the tree of life and characterizing biodiversity1-4. Sparse taxon sampling has previously been proposed to confound phylogenetic inference5, and captures only a fraction of the genomic diversity. Here we report a substantial step towards the dense representation of avian phylogenetic and molecular diversity, by analysing 363 genomes from 92.4% of bird families-including 267 newly sequenced genomes produced for phase II of the Bird 10,000 Genomes (B10K) Project. We use this comparative genome dataset in combination with a pipeline that leverages a reference-free whole-genome alignment to identify orthologous regions in greater numbers than has previously been possible and to recognize genomic novelties in particular bird lineages. The densely sampled alignment provides a single-base-pair map of selection, has more than doubled the fraction of bases that are confidently predicted to be under conservation and reveals extensive patterns of weak selection in predominantly non-coding DNA. Our results demonstrate that increasing the diversity of genomes used in comparative studies can reveal more shared and lineage-specific variation, and improve the investigation of genomic characteristics. We anticipate that this genomic resource will offer new perspectives on evolutionary processes in cross-species comparative analyses and assist in efforts to conserve species.

Pubmed ID: 33177665 RIS Download

Associated grants

  • Agency: NHLBI NIH HHS, United States
    Id: U01 HL137183
  • Agency: Howard Hughes Medical Institute, United States
  • Agency: NHGRI NIH HHS, United States
    Id: T32 HG008345
  • Agency: NHGRI NIH HHS, United States
    Id: R01 HG010053
  • Agency: NHGRI NIH HHS, United States
    Id: R01 HG010485
  • Agency: NHGRI NIH HHS, United States
    Id: U54 HG007990
  • Agency: NIGMS NIH HHS, United States
    Id: R35 GM133412

Publication data is provided by the National Library of Medicine ® and PubMed ®. Data is retrieved from PubMed ® on a weekly schedule. For terms and conditions see the National Library of Medicine Terms and Conditions.

This is a list of tools and resources that we have found mentioned in this publication.


RefSeq (tool)

RRID:SCR_003496

Collection of curated, non-redundant genomic DNA, transcript RNA, and protein sequences produced by NCBI. Provides a reference for genome annotation, gene identification and characterization, mutation and polymorphism analysis, expression studies, and comparative analyses. Accessed through the Nucleotide and Protein databases.

View all literature mentions

NONCODE (tool)

RRID:SCR_007822

Collection of non-coding RNAs (excluding tRNAs and rRNAs) as an integrated knowledge database. Used to get text information such as class,name,location,related publication,mechanism through which it exerts its function, view figures which show their location in the genome or in a specific DNA fragment, and the regulation elements flanking the ncRNA gene sequences.

View all literature mentions

ALLPATHS-LG (tool)

RRID:SCR_010742

Software tool as whole genome shotgun assembler that can generate high quality genome assemblies using short reads (~100bp) such as those produced by the new generation of sequencers.

View all literature mentions

SOAPdenovo (tool)

RRID:SCR_010752

THIS RESOURCE IS NO LONGER IN SERVICE. Documented on February 24,2023. Software tool for de novo assembly of human genomes with massively parallel short read sequencing.Short-read assembly method that can build de novo draft assembly for human sized genomes.Software package for assembling short oligonucleotide into contigs and scaffolds.

View all literature mentions

RepeatMasker (tool)

RRID:SCR_012954

Software tool that screens DNA sequences for interspersed repeats and low complexity DNA sequences. The output of the program is a detailed annotation of the repeats that are present in the query sequence as well as a modified version of the query sequence in which all the annotated repeats have been masked (default: replaced by Ns). Currently over 56% of human genomic sequence is identified and masked by the program. Sequence comparisons in RepeatMasker are performed by one of several popular search engines including nhmmer, cross_match, ABBlast/WUBlast, RMBlast and Decypher. RepeatMasker makes use of curated libraries of repeats and currently supports Dfam ( profile HMM library ) and RepBase ( consensus sequence library ).

View all literature mentions

BUSCO (tool)

RRID:SCR_015008

Software tool to quantitatively measure genome assembly and annotation completeness based on evolutionarily informed expectations of gene content.

View all literature mentions

RepeatModeler (tool)

RRID:SCR_015027

Sequence analysis software that performs repeat family identification and creates models for sequence data. RepeatModeler utilizes RepeatScout and RECON to identify repeat element boundaries and family relationships.

View all literature mentions

GeneWise (tool)

RRID:SCR_015054

Gene alignment tool from the EBI which predicts gene structure using similar protein sequences. See also the associated GenomeWise tool.

View all literature mentions

Conservation (tool)

RRID:SCR_016064

Software for scoring protein sequence conservation using the Jensen-Shannon divergence. It can be used to predict catalytic sites and residues near bound ligands.

View all literature mentions

Examl (tool)

RRID:SCR_016087

Source code for large-scale phylogenetic analyses on whole-transcriptome and whole-genome alignments using supercomputers.

View all literature mentions

UniProt (tool)

RRID:SCR_002380

Collection of data of protein sequence and functional information. Resource for protein sequence and annotation data. Consortium for preservation of the UniProt databases: UniProt Knowledgebase (UniProtKB), UniProt Reference Clusters (UniRef), and UniProt Archive (UniParc), UniProt Proteomes. Collaboration between European Bioinformatics Institute (EMBL-EBI), SIB Swiss Institute of Bioinformatics and Protein Information Resource. Swiss-Prot is a curated subset of UniProtKB.

View all literature mentions

UCSC Genome Browser (tool)

RRID:SCR_005780

Portal to interactively visualize genomic data. Provides reference sequences and working draft assemblies for collection of genomes and access to ENCODE and Neanderthal projects. Includes collection of vertebrate and model organism assemblies and annotations, along with suite of tools for viewing, analyzing and downloading data.

View all literature mentions

NCBI (tool)

RRID:SCR_006472

A portal to biomedical and genomic information. NCBI creates public databases, conducts research in computational biology, develops software tools for analyzing genome data, and disseminates biomedical information for the better understanding of molecular processes affecting human health and disease.

View all literature mentions