Searching across hundreds of databases

Our searching services are busy right now. Your search will reload in five seconds.

X
Forgot Password

If you have forgotten your password you can enter your email here and get a temporary password sent to your email.

X
Forgot Password

If you have forgotten your password you can enter your email here and get a temporary password sent to your email.

Comparative analysis reveals signatures of differentiation amid genomic polymorphism in Lake Malawi cichlids.

Genome biology | 2008

Cichlid fish from East Africa are remarkable for phenotypic and behavioral diversity on a backdrop of genomic similarity. In 2006, the Joint Genome Institute completed low coverage survey sequencing of the genomes of five phenotypically and ecologically diverse Lake Malawi species. We report a computational and comparative analysis of these data that provides insight into the mechanisms that make closely related species different from one another.

Pubmed ID: 18616806 RIS Download

Research resources used in this publication

None found

Antibodies used in this publication

None found

Associated grants

  • Agency: NIDCR NIH HHS, United States
    Id: R21 DE017182

Publication data is provided by the National Library of Medicine ® and PubMed ®. Data is retrieved from PubMed ® on a weekly schedule. For terms and conditions see the National Library of Medicine Terms and Conditions.

This is a list of tools and resources that we have found mentioned in this publication.


Gene Index Project (tool)

RRID:SCR_002148

THIS RESOURCE IS NO LONGER IN SERVICE, documented May 10, 2017. A pilot effort that has developed a centralized, web-based biospecimen locator that presents biospecimens collected and stored at participating Arizona hospitals and biospecimen banks, which are available for acquisition and use by researchers. Researchers may use this site to browse, search and request biospecimens to use in qualified studies. The development of the ABL was guided by the Arizona Biospecimen Consortium (ABC), a consortium of hospitals and medical centers in the Phoenix area, and is now being piloted by this Consortium under the direction of ABRC. You may browse by type (cells, fluid, molecular, tissue) or disease. Common data elements decided by the ABC Standards Committee, based on data elements on the National Cancer Institute''s (NCI''s) Common Biorepository Model (CBM), are displayed. These describe the minimum set of data elements that the NCI determined were most important for a researcher to see about a biospecimen. The ABL currently does not display information on whether or not clinical data is available to accompany the biospecimens. However, a requester has the ability to solicit clinical data in the request. Once a request is approved, the biospecimen provider will contact the requester to discuss the request (and the requester''s questions) before finalizing the invoice and shipment. The ABL is available to the public to browse. In order to request biospecimens from the ABL, the researcher will be required to submit the requested required information. Upon submission of the information, shipment of the requested biospecimen(s) will be dependent on the scientific and institutional review approval. Account required. Registration is open to everyone.. Documented on August 19,2019.The goal of The Gene Index Project is to use the available Expressed Sequence Transcript (EST) and gene sequences, along with the reference genomes wherever available, to provide an inventory of likely genes and their variants and to annotate these with information regarding the functional roles played by these genes and their products. The promise of genome projects has been a complete catalog of genes in a wide range of organisms. While genome projects have been successful in providing reference genome sequences, the problem of finding genes and their variants in genomic sequence remains an ongoing challenge. TGI has created an inventory that contains genes and their variants together with description. In addition, this resource is attempting to use these catalogs to find links between genes and pathways in different species and to provide lists of features within completed genomes that can aid in the understanding of how gene expression is regulated. DATABASES *Eukaryotic Gene Orthologues (formerly known as TOGA - TIGR Orthologous Gene Alignment): Eukaryotic Gene Orthologues (EGO) at DFGI are generated by pair-wise comparison between the Tentative Consensus (TC) sequences that comprise the Dana Farber Gene Indices from individual organisms. The reciprocal pairs of the best match were clustered into individual groups and multiple sequence alignments were displayed for each group. *GeneChip Oncology Database (GCOD):Cancer gene expression database is a collection of publicly available microarray expression data on Affymetrix GeneChip Arrays related to human cancers. Currently only datasets with available raw data (Affymetrix .CEL files) are processed. All processed datasets were subjected to extensive manual curation, uniform processing and consistent quality control. You can browse the experiments in our collection, perform statistical analysis, and download processed data; or to search gene expression profiles using Entrez gene symbol, Unigene ID, or Affymetrix probeset ID. *Gene Indices: As of July 1, 2008, there are 111 publicly available gene indices. They are separated into 4 categories for better organization and easier access. Animal: 41, Plant: 45, Protist: 15, Fungal: 10 *Genomic Maps: Human, mouse, rat, chicken, drosophila melanogaster, zebrafish, mosquito, caenorhabditis elegans, Arabidopsis thaliana, rice, yeast, fission yeast Dana-Farber Cancer Institute (DFCI) Gene Indices Software Tools: *TGI Clustering tools (TGICL): a software system for fast clustering of large EST datasets. *GICL: this package contains the scripts and all the necessary pre-compiled binaries for 32bit Linux systems. *clview: an assembly file viewer. *SeqClean:a script for automated trimming and validation of ESTs or other DNA sequences by screening for various contaminants, low quality and low-complexity sequences. *cdbfasta/cdbyank: fast indexing/retrieval of fasta records from flat file databases. *DAS/XML Genomic Viewer The Genomic viewer borrows modules from http://www.biodas.org (lstein (at) cshl.org) & http://webreference.com.

View all literature mentions

BioPerl (tool)

RRID:SCR_002989

BioPerl is a community effort to produce Perl code which is useful in biology. This toolkit of perl modules is useful in building bioinformatics solutions in Perl. It is built in an object-oriented manner so that many modules depend on each other to achieve a task. The collection of modules in the bioperl-live repository consist of the core of the functionality of bioperl. Additionally auxiliary modules for creating graphical interfaces (bioperl-gui), persistent storage in RDMBS (bioperl-db), running and parsing the results from hundreds of bioinformatics applications (Run package), software to automate bioinformatic analyses (bioperl-pipeline) are all available as Git modules in our repository. The BioPerl toolkit provides a library of hundreds of routines for processing sequence, annotation, alignment, and sequence analysis reports. It often serves as a bridge between different computational biology applications assisting the user to construct analysis pipelines. This chapter illustrates how BioPerl facilitates tasks such as writing scripts summarizing information from BLAST reports or extracting key annotation details from a GenBank sequence record. BioPerl includes modules written by Sohel Merchant of the GO Consortium for parsing and manipulating OBO ontologies. Platform: Windows compatible, Mac OS X compatible, Linux compatible, Unix compatible

View all literature mentions

STRUCTURE (tool)

RRID:SCR_002151

Software package for using multi locus genotype data to investigate population structure. Used for inferring presence of distinct populations, assigning individuals to populations, studying hybrid zones, identifying migrants and admixed individuals, and estimating population allele frequencies in situations where many individuals are migrants or admixed. Can be applied to most of commonly used genetic markers, including SNPS, microsatellites, RFLPs and Amplified Fragment Length Polymorphisms.

View all literature mentions

RepeatMasker (tool)

RRID:SCR_012954

Software tool that screens DNA sequences for interspersed repeats and low complexity DNA sequences. The output of the program is a detailed annotation of the repeats that are present in the query sequence as well as a modified version of the query sequence in which all the annotated repeats have been masked (default: replaced by Ns). Currently over 56% of human genomic sequence is identified and masked by the program. Sequence comparisons in RepeatMasker are performed by one of several popular search engines including nhmmer, cross_match, ABBlast/WUBlast, RMBlast and Decypher. RepeatMasker makes use of curated libraries of repeats and currently supports Dfam ( profile HMM library ) and RepBase ( consensus sequence library ).

View all literature mentions

Trace Archive (tool)

RRID:SCR_013788

An online repository which houses sequencing data from gel and capillary platforms (such as Applied Biosystems ABI 3730®). Most sequences are derived from Whole Genome Shotgun sequencing. Large data sets as well as only a few sequences can be obtained.

View all literature mentions