Searching across hundreds of databases

Our searching services are busy right now. Your search will reload in five seconds.

X
Forgot Password

If you have forgotten your password you can enter your email here and get a temporary password sent to your email.

X
Forgot Password

If you have forgotten your password you can enter your email here and get a temporary password sent to your email.

This service exclusively searches for literature that cites resources. Please be aware that the total number of searchable documents is limited to those containing RRIDs and does not include all open-access literature.

Search

Type in a keyword to search

On page 1 showing 1 ~ 20 papers out of 186 papers

Map of physical interactions between extracellular domains of Arabidopsis leucine-rich repeat receptor kinases.

  • G Adam Mott‎ et al.
  • Scientific data‎
  • 2019‎

Plants use surface receptors to perceive information about many aspects of their local environment. These receptors physically interact to form both steady state and signalling competent complexes. The signalling events downstream of receptor activation impact both plant developmental and immune responses. Here, we present a comprehensive study of the physical interactions between the extracellular domains of leucine-rich repeat receptor kinases (LRR-RKs) in Arabidopsis. Using a sensitized assay, we tested reciprocal interactions among 200 of the 225 Arabidopsis LRR-RKs for a total search space of 40,000 interactions. Applying a stringent statistical cut-off and requiring that interactions performed well in both bait-prey and prey-bait orientations resulted in a high-confidence set of 567 bidirectional interactions. Additionally, we identified a total of 2,586 unidirectional interactions, which passed our stringent statistical cut-off in only one orientation. These datasets will guide further investigation into the regulatory roles of LRR-RKs in plant developmental and immune signalling decisions.


Chromosome-level genome of spider Pardosa pseudoannulata and cuticle protein genes in environmental stresses.

  • Na Yu‎ et al.
  • Scientific data‎
  • 2024‎

Spiders are representative arthropods of adaptive radiation. The high-quality genomes have only been reported in several web weaver spider species, leaving the wandering spiders' genomic information scarce. The pond wolf spider, Pardosa pseudoannulata, is a representative species in the retrolateral titial apophysis (RTA) clade. We present a chromosome-level P. pseusoannulata genome assembly of 2.42 Gb in size with a scaffold N50 of 169.99 Mb. Hi-C scaffolding assigns 94.83% of the bases to 15 pseudo-chromosomes. The repeats account for 52.79% of the assembly. The assembly includes 96.2% of the complete arthropod universal single-copy orthologs. Gene annotation predicted 24,530 protein-coding genes with a BUSCO score of 95.8% complete. We identified duplicate clusters of Hox genes and an expanded cuticle protein gene family with 243 genes. The expression patterns of CPR genes change in response to environmental stresses such as coldness and insecticide exposure. The high-quality P. pseudoannulata genome provides valuable information for functional and comparative studies in spiders.


Unified access to up-to-date residue-level annotations from UniProtKB and other biological databases for PDB data.

  • Preeti Choudhary‎ et al.
  • Scientific data‎
  • 2023‎

More than 61,000 proteins have up-to-date correspondence between their amino acid sequence (UniProtKB) and their 3D structures (PDB), enabled by the Structure Integration with Function, Taxonomy and Sequences (SIFTS) resource. SIFTS incorporates residue-level annotations from many other biological resources. SIFTS data is available in various formats like XML, CSV and TSV format or also accessible via the PDBe REST API but always maintained separately from the structure data (PDBx/mmCIF file) in the PDB archive. Here, we extended the wwPDB PDBx/mmCIF data dictionary with additional categories to accommodate SIFTS data and added the UniProtKB, Pfam, SCOP2, and CATH residue-level annotations directly into the PDBx/mmCIF files from the PDB archive. With the integrated UniProtKB annotations, these files now provide consistent numbering of residues in different PDB entries allowing easy comparison of structure models. The extended dictionary yields a more consistent, standardised metadata description without altering the core PDB information. This development enables up-to-date cross-reference information at the residue level resulting in better data interoperability, supporting improved data analysis and visualisation.


ACO2 clinicobiological dataset with extensive phenotype ontology annotation.

  • Khadidja Guehlouz‎ et al.
  • Scientific data‎
  • 2021‎

Pathogenic variants of the aconitase 2 gene (ACO2) are responsible for a broad clinical spectrum involving optic nerve degeneration, ranging from isolated optic neuropathy with recessive or dominant inheritance, to complex neurodegenerative syndromes with recessive transmission. We created the first public locus-specific database (LSDB) dedicated to ACO2 within the "Global Variome shared LOVD" using exclusively the Human Phenotype Ontology (HPO), a standard vocabulary for describing phenotypic abnormalities. All the variants and clinical cases listed in the literature were incorporated into the database, from which we produced a dataset. We followed a rational and comprehensive approach based on the HPO thesaurus, demonstrating that ACO2 patients should not be classified separately between isolated and syndromic cases. Our data highlight that certain syndromic patients do not have optic neuropathy and provide support for the classification of the recurrent pathogenic variants c.220C>G and c.336C>G as likely pathogenic. Overall, our data records demonstrate that the clinical spectrum of ACO2 should be considered as a continuum of symptoms and refines the classification of some common variants.


Hybrid genome assembly and annotation of Danionella translucida.

  • Mykola Kadobianskyi‎ et al.
  • Scientific data‎
  • 2019‎

Studying neuronal circuits at cellular resolution is very challenging in vertebrates due to the size and optical turbidity of their brains. Danionella translucida, a close relative of zebrafish, was recently introduced as a model organism for investigating neural network interactions in adult individuals. Danionella remains transparent throughout its life, has the smallest known vertebrate brain and possesses a rich repertoire of complex behaviours. Here we sequenced, assembled and annotated the Danionella translucida genome employing a hybrid Illumina/Nanopore read library as well as RNA-seq of embryonic, larval and adult mRNA. We achieved high assembly continuity using low-coverage long-read data and annotated a large fraction of the transcriptome. This dataset will pave the way for molecular research and targeted genetic manipulation of this novel model organism.


Domain-centric database to uncover structure of minimally characterized viral genomes.

  • John C Bramley‎ et al.
  • Scientific data‎
  • 2020‎

Protein domain-based approaches to analyzing sequence data are valuable tools for examining and exploring genomic architecture across genomes of different organisms. Here, we present a complete dataset of domains from the publicly available sequence data of 9,051 reference viral genomes. The data provided contain information such as sequence position and neighboring domains from 30,947 pHMM-identified domains from each reference viral genome. Domains were identified from viral whole-genome sequence using automated profile Hidden Markov Models (pHMM). This study also describes the framework for constructing "domain neighborhoods", as well as the dataset representing it. These data can be used to examine shared and differing domain architectures across viral genomes, to elucidate potential functional properties of genes, and potentially to classify viruses.


309 metagenome assembled microbial genomes from deep sediment samples in the Gulfs of Kathiawar Peninsula.

  • Neelam M Nathani‎ et al.
  • Scientific data‎
  • 2021‎

Prokaryoplankton genomes from the deep marine sediments are less explored compared to shallow shore sediments. The Gulfs of Kathiawar peninsula experience varied currents and inputs from different on-shore activities. Any perturbations would directly influence the microbiome and their normal homeostasis. Advancements in reconstructing genomes from metagenomes allows us to understand the role of individual unculturable microbes in ecological niches like the Gulf sediments. Here, we report 309 bacterial and archaeal genomes assembled from metagenomics data of deep sediments from sites in the Gulf of Khambhat and Gulf of Kutch as well as a sample from the Arabian Sea. Phylogenomics classified them into 5 archaeal and 18 bacterial phyla. The genomes will facilitate understanding of the physiology, adaptation and impact of on-shore anthropogenic activities on the deep sediment microbes.


Chromosomal-scale genome assembly of the near-extinction big-head schizothorcin (Aspiorhynchus laticeps).

  • Jiangong Niu‎ et al.
  • Scientific data‎
  • 2022‎

The big-head schizothorcin (Aspiorhynchus laticeps) is an endemic and near-extinction freshwater fish in Xinjiang, China. In this study, a chromosome-scale genome assembly of A. laticeps was generated using PacBio and Hi-C techniques. The PacBio sequencing data resulted in a 1.58 Gb assembly with a contig N50 of 1.27 Mb. Using Hi-C scaffolding approach, 88.38% of the initial assembled sequences were anchored and oriented into a chromosomal-scale assembly. The final assembly consisted of 25 pseudo-chromosomes that yielded 1.37 Gb of sequence, with a scaffold N50 of 44.02 Mb. BUSCO analysis showed a completeness score of 93.7%. The genome contained 48,537 predicted protein-coding genes and 58.31% of the assembly was annotated as repetitive sequences. Whole genome duplication events were further confirmed using 4dTv analysis. The genome assembly of A. laticeps should be valuable and important to understand the genetic adaptation and endangerment process of this species, which could lead to more effective management and conservation of the big-head schizothorcin and related freshwater fish species.


Draft genome of the big-headed turtle Platysternon megacephalum.

  • Dainan Cao‎ et al.
  • Scientific data‎
  • 2019‎

The big-headed turtle, Platysternon megacephalum, as the sole member of the monotypic family Platysternidae, has a number of distinct characteristics including an extra-large head, long tail, flat carapace, and a preference for low water temperature environments. We performed whole genome sequencing, assembly, and gene annotation of an adult male big-headed turtle based on the Illumina HiSeq X genomic sequencing platform. We generated ~497.1 Gb of raw sequencing data (×208.9 depth) and produced a draft genome with a total length of 2.32 Gb and contig and scaffold N50 sizes of 41.8 kb and 7.22 Mb, respectively. We also identified 924 Mb (39.84%) of repetitive sequences, 25,995 protein-coding genes, and 19,177 non-coding RNAs. We generated the first de novo genome of the big-headed turtle; these data will be essential to the further understanding and exploration of the genomic innovations and molecular mechanisms contributing to its unique morphology and physiological features.


Chromosomal level genome assemblies of two Malus crabapple cultivars Flame and Royalty.

  • Hua Li‎ et al.
  • Scientific data‎
  • 2024‎

Malus hybrid 'Flame' and Malus hybrid 'Royalty' are representative ornamental crabapples, rich in flavonoids and serving as the preferred materials for studying the coloration mechanism. We generated two sets of high-quality chromosome-level and haplotype-resolved genome of 'Flame' with sizes of 688.2 Mb and 675.7 Mb, and those of 'Royalty' with sizes of 674.1 Mb and 663.6 Mb, all anchored to 17 chromosomes and with a high BUSCO completeness score nearly 99.0%. A total of 47,833 and 47,307 protein-coding genes were annotated in the two haplotype genomes of 'Flame', and the numbers of 'Royalty' were 46,305 and 46,920 individually. The assembled high-quality genomes offer new resources for studying the origin and adaptive evolution of crabapples and the molecular basis of the accumulation of flavonoids and anthocyanins, facilitating molecular breeding of Malus plants.


Haplotype-resolved chromosomal-level genome assembly of Buzhaye (Microcos paniculata).

  • Detuan Liu‎ et al.
  • Scientific data‎
  • 2023‎

Microcos paniculata is a shrub used traditionally as folk medicine and to make herbal teas. Previous research into this species has mainly focused on its chemical composition and medicinal value. However, the lack of a reference genome limits the study of the molecular mechanisms of active compounds in this species. Here, we assembled a haplotype-resolved chromosome-level genome of M. paniculata based on PacBio HiFi and Hi-C data. The assembly contains two haploid genomes with sizes 399.43 Mb and 393.10 Mb, with contig N50 lengths of 43.44 Mb and 30.17 Mb, respectively. About 99.93% of the assembled sequences could be anchored to 18 pseudo-chromosomes. Additionally, a total of 482 Mb repeat sequences were identified, accounting for 60.76% of the genome. A total of 49,439 protein-coding genes were identified, of which 48,979 (99%) were functionally annotated. This haplotype-resolved chromosome-level assembly and annotation of M. paniculata will serve as a valuable resource for investigating the biosynthesis and genetic basis of active compounds in this species, as well as advancing evolutionary phylogenomic studies in Malvales.


A haplotype-resolved genome assembly of Rhododendron vialii based on PacBio HiFi reads and Hi-C data.

  • Yuhang Chang‎ et al.
  • Scientific data‎
  • 2023‎

Rhododendron vialii (subgen. Azaleastrum) is an evergreen shrub with high ornamental value. This species has been listed as a plant species with extremely small populations (PSESP) for urgent protection by China's Yunnan provincial government in 2021, due to anthropogenic habitat fragmentation. However, limited genomic resources hinder scientifically understanding of genetic threats that the species is currently facing. In this study, we assembled a high-quality haplotype-resolved genome of R. vialii based on PacBio HiFi long reads and Hi-C reads. The assembly contains two haploid genomes with sizes 532.73 Mb and 521.98 Mb, with contig N50 length of 35.67 Mb and 34.70 Mb, respectively. About 99.92% of the assembled sequences could be anchored to 26 pseudochromosomes, and 14 gapless assembled chromosomes were included in this assembly. Additionally, 60,926 protein-coding genes were identified, of which 93.82% were functionally annotated. This is the first reported genome of R. vialii, and hopefully it will lay the foundations for further research into the conservation genomics and horticultural domestication of this ornamentally important species.


The sequence and de novo assembly of Oxygymnocypris stewartii genome.

  • Hai-Ping Liu‎ et al.
  • Scientific data‎
  • 2019‎

Animal genomes in the Qinghai-Tibetan Plateau provide valuable resources for scientists to understand the molecular mechanism of environmental adaptation. Tibetan fish species play essential roles in the local ecology; however, the genomic information for native fishes was still insufficient. Oxygymnocypris stewartii, belonging to Oxygymnocypris genus, Schizothoracinae subfamily, is a native fish in the Tibetan plateau living within the elevation from roughly 3,000 m to 4,200 m. In this report, PacBio and Illumina sequencing platform were used to generate ~385.3 Gb genomic sequencing data. A genome of about 1,849.2 Mb was obtained with a contig N50 length of 257.1 kb. More than 44.5% of the genome were identified as repetitive elements, and 46,400 protein-coding genes were annotated in the genome. The assembled genome can be used as a reference for future population genetic studies of O. stewartii and will improve our understanding of high altitude adaptation of fishes in the Qinghai-Tibetan Plateau.


De novo transcriptome assembly and annotation for gene discovery in Salamandra salamandra at the larval stage.

  • Pietro Libro‎ et al.
  • Scientific data‎
  • 2023‎

Dispersal is a key process in ecology and evolutionary biology, as it shapes biodiversity patterns over space and time. Attitude to disperse is unevenly distributed among individuals within populations, and that individual personality can have pivotal roles in the shaping of this attitude. Here, we assembled and annotated the first de novo transcriptome of the head tissues of Salamandra salamandra from individuals, representative of distinct behavioral profiles. We obtained 1,153,432,918 reads, which were successfully assembled and annotated. The high-quality of the assembly was confirmed by three assembly validators. The alignment of contigs against the de novo transcriptome led to a mapping percentage higher than 94%. The homology annotation with DIAMOND led to 153,048 (blastx) and 95,942 (blastp) shared contigs, annotated on NR, Swiss-Prot and TrEMBL. The domain and site protein prediction led to 9850 GO-annotated contigs. This de novo transcriptome represents reliable reference for comparative gene expression studies between alternative behavioral types, for comparative gene expression studies within Salamandra, and for whole transcriptome and proteome studies in amphibians.


A chromosome-level genome assembly of the Asian arowana, Scleropages formosus.

  • Jia Li‎ et al.
  • Scientific data‎
  • 2016‎

Asian arowana (Scleropages formosus), an ancient teleost belonging to the Order Osteoglossomorpha, has been a valuable ornamental fish with some varieties. However, its biological studies and breeding germplasm have been remarkably limited by the lack of a reference genome. To solve these problems, here we report high-quality genome sequences of three common varieties of Asian arowana (the golden, red and green arowana). We firstly generated a chromosome-level genome assembly of the golden arowana, on basis of the genetic linkage map constructed with the restriction site-associated DNA sequencing (RAD-seq). In addition, we obtained draft genome assemblies of the red and green varieties. Finally, we annotated 22,016, 21,256 and 21,524 protein-coding genes in the genome assemblies of golden, red and green varieties respectively. Our data were deposited in publicly accessible repositories to promote biological research and molecular breeding of Asian arowana.


Draft genomes of female and male turbot Scophthalmus maximus.

  • Xi-Wen Xu‎ et al.
  • Scientific data‎
  • 2020‎

Turbot (Scophthalmus maximus) is a commercially important flatfish species in aquaculture. It has a drastic sexual dimorphism, with females growing faster than males. In the present study, we sequenced and de novo assembled female and male turbot genomes. The assembled female genome was 568 Mb (scaffold N50, 6.2 Mb, BUSCO 97.4%), and the male genome was 584 Mb (scaffold N50, 5.9 Mb, BUSCO 96.6%). Using two genetic maps, we anchored female scaffolds representing 535 Mb onto 22 chromosomes. Annotation of the female anchored genome identified 87.8 Mb transposon elements and 20,134 genes. We identified 17,936 gene families, of which 369 gene families were flatfish specific. Phylogenetic analysis showed that the turbot, Japanese flounder and Chinese tongue sole form a clade that diverged from other teleosts approximately 78 Mya. This report of female and male turbot draft genomes and annotated genes provides a new resource for identifying sex determination genes, elucidating the evolution of adaptive traits in flatfish and developing genetic techniques to increase the sustainability of turbot aquaculture.


Improved chromosomal-level genome assembly and re-annotation of leopard coral grouper.

  • Wentao Han‎ et al.
  • Scientific data‎
  • 2023‎

Plectropomus leopardus, as known as leopard coral grouper, is a valuable marine fish that has gradually been bred artificially. To promote future conservation, molecular breeding, and comparative studies, we generated an improved high-quality chromosomal-level genome assembly of leopard coral grouper using Nanopore long-reads, Illumina short reads, and the Hi-C sequencing data. The draft genome is 849.74 Mb with 45 contigs and N50 of 35.59 Mb. Finally, a total of 846.49 Mb corresponding to 99.6% of the contig sequences was anchored to 24 pseudo-chromosomes using Hi-C technology. A final set of 25,965 genes is annotated after manual curation of the predicted gene models, and BUSCO analysis yielded a completeness score of 99.5%. This study significantly improves the utility of the grouper genome and provided a reference for the study of molecular breeding, genomics and biology in this species.


De novo transcriptome assembly and gene annotation for the toxic dinoflagellate Dinophysis.

  • Chetan C Gaonkar‎ et al.
  • Scientific data‎
  • 2023‎

Species within the dinoflagellate genus Dinophysis can produce okadiac acid and dinophysistoxins leading to diarrhetic shellfish poisoning. Since the first report of D. ovum from the Gulf of Mexico in 2008, reports of other Dinophysis species across US have increased. Members of the D. cf. acuminata complex (D. acuminata, D. acuta, D. ovum, D. sacculus) are difficult to differentiate due to their morphological similarities. Dinophysis feeds on and steals the chloroplasts from the ciliate, Mesodinium rubrum, which in turn has fed on and captured the chloroplasts of its prey, the cryptophyte Teleaulax amphioxeia. The objective of this study was to generate de novo transcriptomes for new isolates of these mixotrophic organisms. The transcriptomes obtained will serve as a reference for future experiments to assess the effect of different abiotic and biotic conditions and will also provide a useful resource for screening potential marker genes to differentiate among the closely related species within the D. cf. acuminata-complex. The complete comprehensive detailed workflow and links to obtain the transcriptome data are provided.


Transcriptome sequence resource for the cucurbit powdery mildew pathogen Podosphaera xanthii.

  • Rita Milvia De Miccolis Angelini‎ et al.
  • Scientific data‎
  • 2019‎

Podosphaera xanthii is the main causal agent of cucurbit powdery mildew in Southern Italy. Illumina sequencing of mRNA from two P. xanthii isolates of opposite mating types (MAT1-1 and MAT1-2) and their sexual cross was used to obtain a detailed de novo Trinity-based assembly of the transcriptome of the fungus. Over 60 million of high-quality paired-end reads were obtained and assembled into 71,095 contigs corresponding to putative transcripts that were functionally annotated. More than 55% of the assembled transcripts (40,221 contigs) had a significant hit in BLASTx search and included sequences related to sexual compatibility and reproduction, as well as several classes of transposable elements and putative mycoviruses. The availability of these new transcriptomic data and investigations on potential source of genetic variation in P. xanthii will promote new insights on the pathogen and its interactions with host plants and associated microbiome.


A longitudinal dataset of five years of public activity in the Scratch online community.

  • Benjamin Mako Hill‎ et al.
  • Scientific data‎
  • 2017‎

Scratch is a programming environment and an online community where young people can create, share, learn, and communicate. In collaboration with the Scratch Team at MIT, we created a longitudinal dataset of public activity in the Scratch online community during its first five years (2007-2012). The dataset comprises 32 tables with information on more than 1 million Scratch users, nearly 2 million Scratch projects, more than 10 million comments, more than 30 million visits to Scratch projects, and more. To help researchers understand this dataset, and to establish the validity of the data, we also include the source code of every version of the software that operated the website, as well as the software used to generate this dataset. We believe this is the largest and most comprehensive downloadable dataset of youth programming artifacts and communication.


  1. SciCrunch.org Resources

    Welcome to the FDI Lab - SciCrunch.org Resources search. From here you can search through a compilation of resources used by FDI Lab - SciCrunch.org and see how data is organized within our community.

  2. Navigation

    You are currently on the Community Resources tab looking through categories and sources that FDI Lab - SciCrunch.org has compiled. You can navigate through those categories from here or change to a different tab to execute your search through. Each tab gives a different perspective on data.

  3. Logging in and Registering

    If you have an account on FDI Lab - SciCrunch.org then you can log in from here to get additional features in FDI Lab - SciCrunch.org such as Collections, Saved Searches, and managing Resources.

  4. Searching

    Here is the search term that is being executed, you can type in anything you want to search for. Some tips to help searching:

    1. Use quotes around phrases you want to match exactly
    2. You can manually AND and OR terms to change how we search between words
    3. You can add "-" to terms to make sure no results return with that term in them (ex. Cerebellum -CA1)
    4. You can add "+" to terms to require they be in the data
    5. Using autocomplete specifies which branch of our semantics you with to search and can help refine your search
  5. Save Your Search

    You can save any searches you perform for quick access to later from here.

  6. Query Expansion

    We recognized your search term and included synonyms and inferred terms along side your term to help get the data you are looking for.

  7. Collections

    If you are logged into FDI Lab - SciCrunch.org you can add data records to your collections to create custom spreadsheets across multiple sources of data.

  8. Facets

    Here are the facets that you can filter your papers by.

  9. Options

    From here we'll present any options for the literature, such as exporting your current results.

  10. Further Questions

    If you have any further questions please check out our FAQs Page to ask questions and see our tutorials. Click this button to view this tutorial again.

Publications Per Year

X

Year:

Count: