Searching across hundreds of databases

Our searching services are busy right now. Your search will reload in five seconds.

  • Register
Forgot Password

If you have forgotten your password you can enter your email here and get a temporary password sent to your email.


Leaving Community

Are you sure you want to leave this community? Leaving the community will revoke any permissions you have been granted in this community.

Forgot Password

If you have forgotten your password you can enter your email here and get a temporary password sent to your email.

Preparing word cloud



Type in a keyword to search

Filter by last modified time
See new records

Current Facets and Filters

  • Keywords:genome (facet)
  • Website Status : Ascending


Sort alphabetically | Sort by count

Recent searches

SciCrunch Registry is a curated repository of scientific resources, with a focus on biomedical resources, including tools, databases, and core facilities - visit SciCrunch to register your resource.

(last updated: Sep 3, 2019)

Physical Resource or Software Tool Software

718 Results - per page

Resource NameResource TypeDescriptionKeywordsResource IDProper CitationParent OrganizationRelated ConditionFunding AgencyRelationReferenceWebsite StatusAlternate IDsAlternate URLsOld URLs
UCSCinResource, analysis service resource, data analysis service, service resource, production service resourceTHIS RESOURCE IS NO LONGER IN SERVCE, documented September 2, 2016. Beta software used to align and browse a genome.genomeSCR_000571(UCSCin, RRID:SCR_000571)University of Toronto; Ontario; Canada Last checked downnlx_144379
OmicsOffice for NGS SeqSolveResource, software resource, commercial organizationSoftware for secondary and tertiary analysis of Next Generation Sequencing (NGS) sequencing, rna-seq, chip-seq, transcript, alternative splicing, variant, mirna, non-coding rna expression, genome, differential expressionSCR_001222(OmicsOffice for NGS SeqSolve, RRID:SCR_001222)listed by: OMICtoolsPMID:20671709Last checked downOMICS_02111
Tribolium castaneum Genome ProjectResource, topical portal, data or information resource, portal, databaseThis portal provides information about the Tribolium castabeum Genome Project. The Tribolium castaneum genome sequence and its analysis has been published in Nature, two companion journal issues (IBMB and DGE) and numerous other publications listed below. The red flour beetle, Tribolium castaneum, a common pest that is also a genetic model for the Coleoptera. The genome has been sequenced to 7-fold coverage using a whole genome shotgun approach and assembled using the HGSC's assembly engine, Atlas, with methods employed for the Drosophila pseudoobscura genome assembly. Approximately 90% of the genome sequence has been mapped to chromosomes in collaboration with Dick Beeman (USDA ARS) and Sue Brown (Kansas State University). Access to the Data :- Genome Assembly: The long term home of the Tribolium genome is Beetlebase. Tcas 3.0 is now available in GenBank and on our FTP site. Note there are no restrictions of any kind on the Tribolium data as it has been published. Version 2 of the assembly, Tcas_2.0 is available for download using the FTP Data link in the sidebar. The assembly is described in detail in the README in that directory. T.cas_1.0 was a preliminary genome assembly that did not include large insert paired end information and has been moved to a previous assemblies folder. A genboree browser of the Tcas2.0 sequence is available here: There are also links to the genboree browser from the blast results (at the bottom of each reported HSP) if you use the blast server on this page. The original linear scaffold file, Tcas2.0/linearScaffolds/Tcas20050914-genome, posted on the ftp site did not include singleton contigs from the assembly and thus did not fully reflect the tribolium genome sequence, missing ~4.4Mb of sequence in 1860 contigs and reptigs or approximately 2.5% of the assembled sequence. A corrected Tcas20051011-genome file containing these missing sequences is now available on the ftp site. The blast databases have also been updated to reflect this change. All other data is correct, and not affected by this change. :- BLAST Searches: The BLAST link is located in the sidebar. :* Linearized chromosome and unplaced scaffold sequences :* Assembled contigs :* Bin0 unassembled reads and Repeat reads Traces are available from the NCBI Trace Archive by using the link in the sidebar, or by using NCBI MegaBLAST with a same species or cross species query. Sponsors: Funding for this project has been provided by the National Human Genome Research Institute (NHGRI U54 HG003273), which is part of the National Institutes of Health (NIH), and the U.S. Department of Agriculture's Agricultural Research Service (USDA ARS Agreement No. 58-5430-3-338).genetic, chromosome, coleoptera, drosophila, genome, model, pest, red flour beetle, sequence, tribolium castaneumSCR_002848(Tribolium castaneum Genome Project, RRID:SCR_002848)Baylor University; Texas; USA Last checked downnif-0000-25607
Honey Bee Genome ProjectResource, topical portal, portal, data or information resourceThe HGSC has sequenced the honey bee, Apis mellifera. The version 4.0 assembly was released in March 2006 and published in October 2006. The genome sequence is being upgraded with additional sequence coverage. The honey bee is important in the agricultural community as a producer of honey and as a facilitator of pollination. It is a model organism for studying the following human health issues: immunity, allergic reaction, antibiotic resistance, development, mental health, longevity and diseases of the X chromosome. In addition, biologists are interested in the honey bee's social organization and behavioral traits. This project was proposed to the HGSC by a group of dedicated insect biologists, headed by Gene Robinson. Following a workshop at the HGSC and a honey bee white paper, the HGSC began the project in 2002. A 6-fold coverage WGS, BAC sequence from pooled arrays, and an initial genome assembly (Amel_v1.0) were released beginning in 2003. This has been a challenging project with difficulty in recovering AT-rich regions. The WGS data had lower coverage in AT-rich regions and BAC data from clones showed evidence of internal deletions. Additional reads from AT enriched DNA addressed these underrepresented regions. The current assembly Amel_4.0 was produced with Atlas and includes 2.7 million reads (1.8 Gb) or 7.5x coverage of the (clonable) genome. About 97% of STSs, 98% of ESTs, and 96% of cDNAs are represented in the 231 Mb assembly. About 2,500 reads were also produced from a strain of Africanized honey bee and SNPs were extracted. These were released in dbSNP and the NCBI Trace Archive. Analysis of the genome by a consortium of 20 labs has been completed. This produced a gene list derived from five different methods melded through the GLEAN software. Publications include a main paper in Nature and up to forty companion papers in Genome Research and Insect Molecular Biology. Sponsors: Sequencing of the honey bee is jointly funded by National Human Genome Research Institute (NHGRI) and the Department of Agriculture (USDA). Multiple drones from the same queen (strain DH4) were obtained from Danny Weaver of B. Weaver Apiaries. All libraries were made from DNA isolated from these drones. The honey bee BAC library (CHORI-224) was prepared by Pieter de Jong and Katzutoyo Osoegawa at the Children's Hospital Oakland Research Institute.gene, agricultural, allergy, antibiotic, apis mellifera, array, behavioral, biologist, chromosome, development, disease, genome, heath, honey bee, human, immunity, insect, mental heath, organism, pollination, reaction, resistance, sequence, traitSCR_002890(Honey Bee Genome Project, RRID:SCR_002890)Baylor University; Texas; USA Last checked downnif-0000-25604
genomationResource, software resource, software toolkitR package that contains a collection of tools for simplfiying common tasks in genomic feature analysis. It provides functions for reading BED and GFF files as GRanges objects, summarizing genomic features over predefined windows so users can make average enrichment of features over defined regions or produce heatmaps. It can also annotate given regions with other genomic features such as exons,introns and promoters.genome, genomic interval, rSCR_003435(genomation, RRID:SCR_003435)listed by: OMICtoolsLast checked downOMICS_02306
GOtchaResource, analysis service resource, data analysis service, service resource, production service resourceGOtcha provides a prediction of a set of GO terms that can be associated with a given query sequence. Each term is scored independently and the scores calibrated against reference searches to give an accurate percentage likelihood of correctness. These results can be displayed graphically. Why is GOtcha different to what is already out there and why should you be using it? * GOtcha uses a method where it combines information from many search hits, up to and including E-values that are normally discarded. This gives much better sensitivity than other methods. * GOtcha provides a score for each individual term, not just the leaf term or branch. This allows the discrimination between confident assignments that one would find at a more general level and the more specific terms that one would have lower confidence in. * The scores GOtcha provides are calibrated to give a real estimate of correctness. This is expressed as a percentage, giving a result that non-experts are comfortable in interpreting. * GOtcha provides graphical output that gives an overview of the confidence in, or potential alternatives for, particular GO term assignments. The tool is currently web-based; contact David Martin for details of the standalone version. Platform: Online toolfunction, protein, prediction, genome, annotation, gene, statistical analysisSCR_005790(GOtcha, RRID:SCR_005790)University of Dundee; Scotland; United Kingdom European Union fifth framework, Wellcome Trustrelated to: Gene Ontology, listed by: Gene Ontology ToolsPMID:15550167Last checked downnlx_149269
OligoGenome Resource, data or information resource, resource, databaseThe Stanford Human OligoGenome Project hosts a database of capture oligonucleotides for conducting high-throughput targeted resequencing of the human genome. This set of capture oligonucleotides covers over 92% of the human genome for build 37 / hg19 and over 99% of the coding regions defined by the Consensus Coding Sequence (CCDS). The capture reaction uses a highly multiplexed approach for selectively circularizing and capturing multiple genomic regions using the in-solution method developed in Natsoulis et al, PLoS One 2011. Combined pools of capture oligonucleotides selectively circularize the genomic DNA target, followed by specific PCR amplification of regions of interest using a universal primer pair common to all of the capture oligonucleotides. Unlike multiplexed PCR methods, selective genomic circularization is capable of efficiently amplifying hundreds of genomic regions simultaneously in multiplex without requiring extensive PCR optimization or producing unwanted side reaction products. Benefits of the selective genomic circularization method are the relative robustness of the technique and low costs of synthesizing standard capture oligonucleotide for selecting genomic targets.oligonucleotide, genome, probe, coding region, oligonucleotide sequence, chromosomeSCR_006025( OligoGenome , RRID:SCR_006025)Stanford University; Stanford; California Doris Duke Clinical Foundation, Howard Hughes Medical Foundation, Liu Bie Ju Cha and Family Fellowship in Cancer, NCI, NHGRI, NIDDK, NLM, Reddere Foundation, Wang Family FoundationPMID:22102592Last checked downnlx_151422
CAGEResource, software resourceExpression profiling and promoter identification software tool for transcriptional network analysis and transcriptome characterization. DeepCAGE, the combination of next-generation sequencing with next generation expression profiling provides unsurpassed solutions for expression profiling and genome annotation. CAGE will be the experimental approach at need to link gene expression and control regions in the genome. With the availability of next-generation sequencing methods, DNAFORM now offers DeepCAGE services. DeepCAGE libraries are prepared for direct analysis by an Illumina/Solexa Sequencer. One sequencing run using one channel on an Illumina/Solexa Sequencer can yield in over 4,000,000 reads per sample. CAGE is based on our full-length cDNA library technology, where an adaptor is ligated to the 5''''-end of full-length cDNAs, which introduces a recognition site for a Class IIs restriction endonuclease adjacent to the 5''''-end of the cDNA. The Class IIs restriction endonuclease, here MmeI, allows for the cloning of short tags as derived from the 5''''-end of transcripts into concatemers for high-throughput sequencing. CAGE tags are further characterized by mapping to genomic sequences, which enables the identification of transcriptional start sites. As such CAGE can contribute to projects in Gene Discovery, Gene Expression, and Promoter Identification. After the genome sequencing projects have provided us with the genetic blueprints for many organisms, new questions have to be answered on how to correlate the observed genotypes with related phenotypes, and how to understand the regulation of genetic information in time and space. The dynamics of living systems and the functional behavior of cells in multicellular organisms has thus become the subject of the emerging field of system biology. Integration of experimental approaches and computer aided theories on a system level will be the fundamental principle to drive systems biology in order to understand the principles behind complex regulatory networks, which will be an ambitious goal requiring new approaches in life sciences. For ordering and additional information, please contact us under contact_at_dnaform.jpgene, gene expression, transcriptome, expression profiling, promoter identification, promoter, gene discovery, cdna, cdna library, genome annotation, genome, annotation, rna library, rna, mrnaSCR_007574(CAGE, RRID:SCR_007574)RIKEN related to: FANTOM DB, CAGE Basic Viewer for Mus musculusLast checked downnif-0000-02631
EdgeBioResource, service resource, commercial organization, core facility, access service resourceA contract research organization that provides genomics services such as sequencing, bioinformatics, NGS data analysis and whole exome sequencing. EdgeBio is a CLIA-approved service provider.contract research organization, CRO, genomics, genome, sequencing, bioinformatics, NGS data analysis, whole exome sequencing, research, Illumina NGSSCR_000183(EdgeBio, RRID:SCR_000183)listed by: Science ExchangeLast checked downSciEx_203
UnSplicerResource, software resourceAn RNA-seq alignment program that provides alignment of short reads to a reference genome. The program requires two inputs that are provided by the output of GeneMark-ES: HMM model parameters and ab initio gene predictions. UnSplicer is a sister pipeline to TrueSight.RNA, sequencing, alignment, short reads, genome, genemark-es, gene predictionSCR_000226(UnSplicer, RRID:SCR_000226)Georgia Institute of Technology; Georgia; USA listed by: OMICtoolsPMID:24259430Last checked downOMICS_01806
Alternative Splicing Annotation Project II DatabaseResource, data or information resource, databaseTHIS RESOURCE IS NO LONGER IN SERVICE, documented on 8/12/13. An expanded version of the Alternative Splicing Annotation Project (ASAP) database with a new interface and integration of comparative features using UCSC BLASTZ multiple alignments. It supports 9 vertebrate species, 4 insects, and nematodes, and provides with extensive alternative splicing analysis and their splicing variants. As for human alternative splicing data, newly added EST libraries were classified and included into previous tissue and cancer classification, and lists of tissue and cancer (normal) specific alternatively spliced genes are re-calculated and updated. They have created a novel orthologous exon and intron databases and their splice variants based on multiple alignment among several species. These orthologous exon and intron database can give more comprehensive homologous gene information than protein similarity based method. Furthermore, splice junction and exon identity among species can be valuable resources to elucidate species-specific genes. ASAP II database can be easily integrated with pygr (unpublished, the Python Graph Database Framework for Bioinformatics) and its powerful features such as graph query, multi-genome alignment query and etc. ASAP II can be searched by several different criteria such as gene symbol, gene name and ID (UniGene, GenBank etc.). The web interface provides 7 different kinds of views: (I) user query, UniGene annotation, orthologous genes and genome browsers; (II) genome alignment; (III) exons and orthologous exons; (IV) introns and orthologous introns; (V) alternative splicing; (IV) isoform and protein sequences; (VII) tissue and cancer vs. normal specificity. ASAP II shows genome alignments of isoforms, exons, and introns in UCSC-like genome browser. All alternative splicing relationships with supporting evidence information, types of alternative splicing patterns, and inclusion rate for skipped exons are listed in separate tables. Users can also search human data for tissue- and cancer-specific splice forms at the bottom of the gene summary page. The p-values for tissue-specificity as log-odds (LOD) scores, and highlight the results for LOD >= 3 and at least 3 EST sequences are all also reported.exon, gene structure, genome, alternative splicing, cancer genome alignment, intron, isoform, orthologous exon, orthologous gene, orthologous intron, protein sequence, splice site, tissue, genome alignment, cancerSCR_000322(Alternative Splicing Annotation Project II Database, RRID:SCR_000322)University of California at Los Angeles; California; USA NCRR, NIDCRrelated to: ASAP: the Alternative Splicing Annotation ProjectPMID:17108355Last checked downnif-0000-02572
Non-Human Genome Segmental Duplication DatabaseResource, data or information resource, databaseTHIS RESOURCE IS NO LONGER IN SERVICE, documented on July 16, 2013. It contains information about segmental duplications in the genomes of chimpanzee, mouse, and rat. The criteria used to identify regions of segmental duplication are: * Sequence identity of at least 90% * Sequence length of at least 5 kb * Not be entirely composed of repetitive elements. BACKGROUND: The high quality of the mouse genome draft sequence and its associated annotations are an invaluable biological resource. Identifying recent duplications in the mouse genome, especially in regions containing genes, may highlight important events in recent murine evolution. In addition, detecting recent sequence duplications can reveal potentially problematic regions of the genome assembly. We use BLAST-based computational heuristics to identify large (>/= 5 kb) and recent (>/= 90% sequence identity) segmental duplications in the mouse genome sequence. Here we present a database of recently duplicated regions of the mouse genome found in the mouse genome sequencing consortium (MGSC) February 2002 and February 2003 assemblies. RESULTS: We determined that 33.6 Mb of 2,695 Mb (1.2%) of sequence from the February 2003 mouse genome sequence assembly is involved in recent segmental duplications, which is less than that observed in the human genome (around 3.5-5%). From this dataset, 8.9 Mb (26%) of the duplication content consisted of "unmapped" chromosome sequence. Moreover, we suspect that an additional 18.5 Mb of sequence is involved in duplication artifacts arising from sequence misassignment errors in this genome assembly. By searching for genes that are located within these regions, we identified 675 genes that mapped to duplicated regions of the mouse genome. Sixteen of these genes appear to have been duplicated independently in the human genome. From our dataset we further characterized a 42 kb recent segmental duplication of Mater, a maternal-effect gene essential for embryogenesis in mice. CONCLUSION: Our results provide an initial analysis of the recently duplicated sequence and gene content of the mouse genome. Many of these duplicated loci, as well as regions identified to be involved in potential sequence misassignment errors, will require further mapping and sequencing to achieve accuracy. A Genome Browser database was set up to display the identified duplication content presented in this work. This data will also be relevant to the growing number of investigators who use the draft genome sequence for experimental design and analysis. The segmental duplication data and summary statistics are available for download and can also be visualized in a genome browser in the GBrowse section. Selected annotation tracks (except the segmental duplication track) have also been obtained from UCSC and loaded into the genome browser. Detailed information (e.g. overlapping genes, overlapping clones, detailed alignment) can be obtained by clicking on a duplication cluster in GBrowse. Both keyword search and BLAT search are available. Analyses based on previous genome assemblies can be found in the Previous Analyses section. Recent Developments The Non-Human Genome Segmental Duplication Database is continually updated including the archived copies of the analysis of all previous genome assemblies and will include all new species as they become available. Acknowledgments We thank The Centre for Applied Genomics at the Hospital for Sick Children (HSC) as well as collaborators worldwide. Supported by Genome Canada the Howard Hughes Medical Institute International Scholar Program (to S.W.S.) and the HSC Foundation.genes, genome, chicken, chimpanzee, chromosome, dna, dog, mouse, rat, segmental duplicationsSCR_000470(Non-Human Genome Segmental Duplication Database, RRID:SCR_000470)Last checked downnif-0000-03194
Opal ResearchResource, software resource, data processing software, data analysis software, sequence analysis software, software applicationSoftware which integrates a comprehensive, automated genome annotation engine with the VAAST and Phevor disease gene prioritization tools to rank gene variants on the severity of their impact on protein function and likelihood to cause disease. Each variant in a gene is analyzed for its impact on protein function, conservation and frequency. Each gene is ranked rather than filtered in order to ensure critical targets are not prematurely removed.sequence analysis software, genome interpretation, variant prioritization, disease gene prioritization, next-generation sequencing, clinical interpretation, clinical genomics software, genome, protein function, disease, genomic variant, mutationSCR_000405(Opal Research, RRID:SCR_000405)related to: VAASTPMID:23895124Last checked downSciRes_000140
PindelResource, software resourceSoftware to detect breakpoints of large deletions, medium sized insertions, inversions, tandem duplications and other structural variants at single-based resolution from next-gen sequence data. It uses a pattern growth approach to identify the breakpoints of these variants from paired-end short reads.deletion, insertion, nucleotide, genome, read, inversion, tandem duplication, structural variant, next-generation sequencing, pattern growth, indel, breakpointSCR_000560(Pindel, RRID:SCR_000560)Washington University School of Medicine in St. Louis; Missouri; USA listed by: OMICtools, works_with: cgpPindelPMID:19561018Last checked downOMICS_00321
Functional BiosciencesResource, service resourceA service that provides low cost DNA sequencing. They utilize microfluidic technology.dna, sequencing, sequence, gene, genome, microfluidic, technologySCR_000943(Functional Biosciences, RRID:SCR_000943)listed by: Science ExchangeLast checked downSciEx_9422
Perlegen/NIEHS National Toxicology: Mouse Genome Resequencing ProjectResource, data set, data or information resourceTHIS RESOURCE IS NO LONGER IN SERVICE, Documented on August 12, 2014. Data, grouped by chromosome, available as flat files for download, of identified DNA polymorphisms (SNPs) in 15 commonly used strains of inbred laboratory mice. Perlegen's SNP, genotype (empirical and imputed), haplotype, trace, and PCR primer data has been compiled with NCBI Mouse Build information to produce data files for public use. Using high-density oligonuclueotide array technology, the study identified over 8 million SNPs and other genetic differences between these strains and the previously sequenced C57BL/6J reference strains (Phase 1). By leveraging data provided by Mark Daly's research team at the Broad Institute, genotypes were also predicted for 40 other common strains (Phase 2). Under an extension to the contract, Eleazar Eskin's group at UCLA has used this data to evaluate SNP associations with phenotypes from the Mouse Phenome Project (the Mouse Phenome Database), and to construct haplotype maps for a total of 94 inbred strains (the Mouse HapMap Project). SNP and genotype positions have been mapped from their original reference coordinates to NCBI Mouse Build 37 coordinates. Note that C57BL6/J strain was not selected for re-sequencing as this data would have been almost entirely redundant with the NCBI reference sequence. Since we did not actually determine genotypes for C57BL6/J, we did not submit genotypes for this strain to dbSNP. However, implicit genotypes for C57BL6/J can be obtained from the reference sequence at each SNP position (the reference allele is the first allele in the ALLELES column). The data is available for download in two different compressed file formats. The files are saved as both PC .zip files and Unix compressed .gz files. At this website, you can: * Learn more about the goals of the Perlegen mouse resequencing project. * Learn more about the array-based resequencing technology used in the project. * Download the SNPs, genotypes, and other data generated by the project, plus sequences of the long-range PCR primers used for SNP discovery. * Browse the mouse genome for SNPs. * View the haplotype blocks within the mouse genome. Mouse Genome Browser The Mouse Genome Browser can be used to visualize genes and the SNPs discovered in this study of genome-wide DNA variation in 15 commonly used, genetically diverse strains of inbred laboratory mice. The reference genome is the C57BL/6J strain NCBI build 37 mouse sequence. In addition to the experimentally-derived genotypes for the original 15 strains, the imputed genotypes for 40 additional inbred mouse strains can also be accessed. Mouse Haplotype Analysis The sequences of 16 commonly used, genetically diverse strains of inbred laboratory mice were analyzed to determine their haplotype structure. The Ancestry Browser shows which ancestral sequence each inbred strain most resembles, along with statistics on the pairwise similarity between the ancestral strains. The Haplotype Viewer shows the haplotype block boundaries and the pairwise similarity for all 56 strains: the 15 used for SNP discovery, the reference strain (C57BL/6J), and the 40 additional strains for which the genotypes were imputed.genetic variation, chromosome, dna, genome, genotype, haplotype, oligonuclueotide, inbred mouse strain, polymorphism, sequence, single-nucleotide polymorphism, c57bl6/jSCR_000726(Perlegen/NIEHS National Toxicology: Mouse Genome Resequencing Project, RRID:SCR_000726)HHSN29120045530C (N01-ES-45530), NIEHSrelated to: Mouse HapMap Imputation Genotype ResourceLast checked downnif-0000-21746
GenoViewerResource, software resourceOpen source viewer / browser software for the SAM / BAM format commonly used in the assembly tasks of Next Generation Sequencing sequencing, sequence, mutation, windows, linux, mac os x, genome, browser, sam, bam, fasta, gff, read error, snp, mnp, insertion, deletionSCR_001203(GenoViewer, RRID:SCR_001203)listed by: OMICtoolsLast checked downOMICS_02146
PlantagoraResource, software resource, data set, data or information resourceA web-based plant genome assembly simulation platform whose resources include out of the box scripts for analyzing assembly data, an on-demand web graphing tool to model your experiment, and a downloadable database with metrics and parameters from over 3,000 simulated genome sequencing, genome assembly, sequencing, genome, 454, illumina, simulateSCR_001227(Plantagora, RRID:SCR_001227)University of Arizona; Arizona; USA listed by: OMICtoolsPMID:22174807Last checked downOMICS_02116
Genome TraxResource, service resource, commercial organizationService that provides a comprehensive compilation of variant knowledge that allows you to identify pathogenic variants in human whole genome or exome sequences. It makes it easy to upload a complete genome?s worth of variations and identify the biologically relevant subset of known mutations, mutations that are novel and appear in a candidate disease genes, or mutations that are predicted to have a deleterious effect. The database includes a comprehensive collection of disease causing mutations from HGMD Professional, regulatory sites from TRANSFAC , and disease genes, drug targets and pathways from PROTEOME, as well as pharmacogenomic variants. It integrates the best public data-sets on somatic mutations, allele frequencies and clinical variants, in their most up-to-date version, for a total of more than 165 million annotations. It is possible to identify known pathogenic variants, remove harmless common variants, and obtain deleterious predictions for novel variants. With family data, it is possible to identify variants that are de novo, compound heterozygous only in the offspring. All of the results can be downloaded to Excel for further review. For core facilities and bioinformaticians, the complete underlying data is made available for download and easy integration into custom analysis pipelines. Genome Trax data is optimized to work with many other software packages, such as ANNOVARTM, CLC bio, Alamut, SimulConsult, and sequencing, genome, exome, sequence, variation, mutation, pathogenic, databaseSCR_001234(Genome Trax, RRID:SCR_001234)BIOBASE Corporation listed by: OMICtoolsLast checked downOMICS_02109
Neurospora crassa DatabaseResource, data or information resource, databaseIt's strategy involves Whole Genome Shotgun (WGS) sequencing, in which sequence from the entire genome is generated and reassembled. This method is standard for microbial genome sequencing, and has been successfully applied to Drosophila. Neurospora is an ideal candidate for this approach because of the low repeat content of the genome. Neurospora crassa Database has expanded the scope of its database by including a mitochondrial annotation, incorporating information from the Neurospora compendium, and assigning NCU numbers to tRNA and rRNAs. They have improved the annotation process to predict untranslated regions and to reduce the number of spurious predictions. As a result, version 3 contains 9,826 genes, 794 fewer than version 2. During the initial phase of a WGS project they sequence both ends of the 4 kb inserts from a plasmid library prepared using randomly sheared and sized-selected DNA. The shotgun reads are assembled by recognizing overlapping regions of sequence and making use of the knowledge of the orientation and distance of the paired reads from each plasmid. Obtaining deep sequence coverage though high levels of sequence redundancy assures that the majority of the genome is represented in the initial assembly and that the consensus sequence is of high quality. Their approach toward the initial assembly was conservative, meaning they would rather fail to join sequence contigs that might overlap each other than risk making false joins between two closely related but non-overlapping genomic regions. Hence, the initial assembly contains many sequence contigs and over time these contigs will increase in size and decrease in number as they are joined together. After shotgun sequencing and assembly there was a second phase of sequencing in which additional sequence was obtained from specific regions that were missing from the original assembly or are recognized to be of low quality in the consensus. The Neurospora crassa sequencing project reflects a close collaboration between the Broad Institute and the Neurospora research community. Principal investigators include Bruce Birren and Chad Nusbaum from the Broad Institute, Matt Sachs at the Oregon Graduate Institute of Science and Technology, Chuck Staben at the University of Kentucky and Jak Kinsey at the Fungal Genetics Stock Center at the University of Kansas Medical Center. In addition, we have a larger Advisory Board made up of a number of Neurospora researchers. Sponsors: They have been funded by the National Science Foundation to sequence the N. crassa genome and make the information publicly available.gene, annotation, compendium, contig, distance, drosophila, genome, mitochondrial, neurospora crassa, plasmid, region, rrna, sequence, trna, untranslatedSCR_001372(Neurospora crassa Database, RRID:SCR_001372)Last checked downnif-0000-20965
  1. RRID Portal Resources

    Welcome to the RRID Resources search. From here you can search through a compilation of resources used by RRID and see how data is organized within our community.

  2. Navigation

    You are currently on the Community Resources tab looking through categories and sources that RRID has compiled. You can navigate through those categories from here or change to a different tab to execute your search through. Each tab gives a different perspective on data.

  3. Logging in and Registering

    If you have an account on RRID then you can log in from here to get additional features in RRID such as Collections, Saved Searches, and managing Resources.

  4. Searching

    Here is the search term that is being executed, you can type in anything you want to search for. Some tips to help searching:

    1. Use quotes around phrases you want to match exactly
    2. You can manually AND and OR terms to change how we search between words
    3. You can add "-" to terms to make sure no results return with that term in them (ex. Cerebellum -CA1)
    4. You can add "+" to terms to require they be in the data
    5. Using autocomplete specifies which branch of our semantics you with to search and can help refine your search
  5. Collections

    If you are logged into RRID you can add data records to your collections to create custom spreadsheets across multiple sources of data.

  6. Facets

    Here are the facets that you can filter the data by.

  7. Further Questions

    If you have any further questions please check out our FAQs Page to ask questions and see our tutorials. Click this button to view this tutorial again.