Searching across hundreds of databases

Our searching services are busy right now. Your search will reload in five seconds.

X
Forgot Password

If you have forgotten your password you can enter your email here and get a temporary password sent to your email.

X
Forgot Password

If you have forgotten your password you can enter your email here and get a temporary password sent to your email.

This service exclusively searches for literature that cites resources. Please be aware that the total number of searchable documents is limited to those containing RRIDs and does not include all open-access literature.

Search

Type in a keyword to search

On page 1 showing 1 ~ 20 papers out of 51 papers

Short reads phasing to construct haplotypes in genomic regions that are associated with body mass index in korean individuals.

  • Kichan Lee‎ et al.
  • Genomics & informatics‎
  • 2014‎

Genome-wide association (GWA) studies have found many important genetic variants that affect various traits. Since these studies are useful to investigate untyped but causal variants using linkage disequilibrium (LD), it would be useful to explore the haplotypes of single-nucleotide polymorphisms (SNPs) within the same LD block of significant associations based on high-density variants from population references. Here, we tried to make a haplotype catalog affecting body mass index (BMI) through an integrative analysis of previously published whole-genome next-generation sequencing (NGS) data of 7 representative Korean individuals and previously known Korean GWA signals. We selected 435 SNPs that were significantly associated with BMI from the GWA analysis and searched 53 LD ranges nearby those SNPs. With the NGS data, the haplotypes were phased within the LDs. A total of 44 possible haplotype blocks for Korean BMI were cataloged. Although the current result constitutes little data, this study provides new insights that may help to identify important haplotypes for traits and low variants nearby significant SNPs. Furthermore, we can build a more comprehensive catalog as a larger dataset becomes available.


Whole genome comparison of donor and cloned dogs.

  • Hak-Min Kim‎ et al.
  • Scientific reports‎
  • 2013‎

Cloning is a process that produces genetically identical organisms. However, the genomic degree of genetic resemblance in clones needs to be determined. In this report, the genomes of a cloned dog and its donor were compared. Compared with a human monozygotic twin, the genome of the cloned dog showed little difference from the genome of the nuclear donor dog in terms of single nucleotide variations, chromosomal instability, and telomere lengths. These findings suggest that cloning by somatic cell nuclear transfer produced an almost identical genome. The whole genome sequence data of donor and cloned dogs can provide a resource for further investigations on epigenetic contributions in phenotypic differences.


Compiling Multicopy Single-Stranded DNA Sequences from Bacterial Genome Sequences.

  • Wonseok Yoo‎ et al.
  • Genomics & informatics‎
  • 2016‎

A retron is a bacterial retroelement that encodes an RNA gene and a reverse transcriptase (RT). The former, once transcribed, works as a template primer for reverse transcription by the latter. The resulting DNA is covalently linked to the upstream part of the RNA; this chimera is called multicopy single-stranded DNA (msDNA), which is extrachromosomal DNA found in many bacterial species. Based on the conserved features in the eight known msDNA sequences, we developed a detection method and applied it to scan National Center for Biotechnology Information (NCBI) RefSeq bacterial genome sequences. Among 16,844 bacterial sequences possessing a retron-type RT domain, we identified 48 unique types of msDNA. Currently, the biological role of msDNA is not well understood. Our work will be a useful tool in studying the distribution, evolution, and physiological role of msDNA.


Analysis of Nuclear Mitochondrial DNA Segments of Nine Plant Species: Size, Distribution, and Insertion Loci.

  • Young-Joon Ko‎ et al.
  • Genomics & informatics‎
  • 2016‎

Nuclear mitochondrial DNA segment (Numt) insertion describes a well-known phenomenon of mitochondrial DNA transfer into a eukaryotic nuclear genome. However, it has not been well understood, especially in plants. Numt insertion patterns vary from species to species in different kingdoms. In this study, the patterns were surveyed in nine plant species, and we found some tip-offs. First, when the mitochondrial genome size is relatively large, the portion of the longer Numt is also larger than the short one. Second, the whole genome duplication event increases the ratio of the shorter Numt portion in the size distribution. Third, Numt insertions are enriched in exon regions. This analysis may be helpful for understanding plant evolution.


ChARM: Discovery of combinatorial chromatin modification patterns in hepatitis B virus X-transformed mouse liver cancer using association rule mining.

  • Sung Hee Park‎ et al.
  • BMC bioinformatics‎
  • 2016‎

Various chromatin modifications, identified in large-scale epigenomic analyses, are associated with distinct phenotypes of different cells and disease phases. To improve our understanding of these variations, many computational methods have been developed to discover novel sites and cell-specific chromatin modifications. Despite the availability of existing methods, there is still room for further improvement when they are applied to resolve the histone code hypothesis. Hence, we aim to investigate the development of a computational method to provide new insights into de novo combinatorial pattern discovery of chromatin modifications to characterize epigenetic variations in distinct phenotypes of different cells.


CysQ of Cryptosporidium parvum, a Protozoa, May Have Been Acquired from Bacteria by Horizontal Gene Transfer.

  • Ji Young Lee‎ et al.
  • Genomics & informatics‎
  • 2012‎

Horizontal gene transfer (HGT) is the movement of genetic material between kingdoms and is considered to play a positive role in adaptation. Cryptosporidium parvum is a parasitic protozoan that causes an infectious disease. Its genome sequencing reported 14 bacteria-like proteins in the nuclear genome. Among them, cgd2_1810, which has been annotated as CysQ, a sulfite synthesis pathway protein, is listed as one of the candidates of genes horizontally transferred from bacterial origin. In this report, we examined this issue using phylogenetic analysis. Our BLAST search showed that C. parvum CysQ protein had the highest similarity with that of proteobacteria. Analysis with NCBI's Conserved Domain Tree showed phylogenetic incongruence, in that C. parvum CysQ protein was located within a branch of proteobacteria in the cd01638 domain, a bacterial member of the inositol monophosphatase family. According to Kyoto Encyclopedia of Genes and Genomes (KEGG) pathway, the sulfate assimilation pathway, where CysQ plays an important role, is well conserved in most eukaryotes as well as prokaryotes. However, the Apicomplexa, including C. parvum, largely lack orthologous genes of the pathway, suggesting its loss in those protozoan lineages. Therefore, we conclude that C. parvum regained cysQ from proteobacteria by HGT, although its functional role is elusive.


FESD: a Functional Element SNPs Database in human.

  • Hyo Jin Kang‎ et al.
  • Nucleic acids research‎
  • 2005‎

We have created the Functional Element SNPs Database (FESD) that categorizes functional elements in human genic regions and provides a set of single nucleotide polymorphisms (SNPs) located within each area. In the FESD, the human genic regions were divided into 10 different functional elements, such as promoter regions, CpG islands, 5'-untranslated regions (5'-UTRs), translation start sites, splice sites, coding exons, introns, translation stop sites, polyadenylation signals and 3'-UTRs, and subsequently, all the known SNPs were assigned to each functional element at their respective position. With the FESD web interface, users can select a set of SNPs in the specific functional elements and get their flanking sequences for genotyping experiments, which will help in finding mutations that contribute to the common and polygenic diseases. A web interface for the FESD is freely available at http://combio.kribb.re.kr/ksnp/resd/.


Localizome: a server for identifying transmembrane topologies and TM helices of eukaryotic proteins utilizing domain information.

  • Sunghoon Lee‎ et al.
  • Nucleic acids research‎
  • 2006‎

The Localizome server predicts the transmembrane (TM) helix number and TM topology of a user-supplied eukaryotic protein and presents the result as an intuitive graphic representation. It utilizes hmmpfam to detect the presence of Pfam domains and a prediction algorithm, Phobius, to predict the TM helices. The results are combined and checked against the TM topology rules stored in a protein domain database called LocaloDom. LocaloDom is a curated database that contains TM topologies and TM helix numbers of known protein domains. It was constructed from Pfam domains combined with Swiss-Prot annotations and Phobius predictions. The Localizome server corrects the combined results of the user sequence to conform to the rules stored in LocaloDom. Compared with other programs, this server showed the highest accuracy for TM topology prediction: for soluble proteins, the accuracy and coverage were 99 and 75%, respectively, while for TM protein domain regions, they were 96 and 68%, respectively. With a graphical representation of TM topology and TM helix positions with the domain units, the Localizome server is a highly accurate and comprehensive information source for subcellular localization for soluble proteins as well as membrane proteins. The Localizome server can be found at http://localizome.org/.


Computational Drug Repositioning for Gastric Cancer using Reversal Gene Expression Profiles.

  • In-Wha Kim‎ et al.
  • Scientific reports‎
  • 2019‎

Treatment of gastric cancer (GC) often produces poor outcomes. Moreover, predicting which GC treatments will be effective remains challenging. Computational drug repositioning using public databases is a promising and efficient tool for discovering new uses for existing drugs. Here we used a computational reversal of gene expression approach based on effects on gene expression signatures by GC disease and drugs to explore new GC drug candidates. Gene expression profiles for individual GC tumoral and normal gastric tissue samples were downloaded from the Gene Expression Omnibus (GEO) and differentially expressed genes (DEGs) in GC were determined with a meta-signature analysis. Profiles drug activity and drug-induced gene expression were downloaded from the ChEMBL and the LINCS databases, respectively. Candidate drugs to treat GC were predicted using reversal gene expression score (RGES). Drug candidates including sorafenib, olaparib, elesclomol, tanespimycin, selumetinib, and ponatinib were predicted to be active for treatment of GC. Meanwhile, GC-related genes such as PLOD3, COL4A1, UBE2C, MIF, and PRPF5 were identified as having gene expression profiles that can be reversed by drugs. These findings support the use of a computational reversal gene expression approach to identify new drug candidates that can be used to treat GC.


Performance Comparison of Two Gene Set Analysis Methods for Genome-wide Association Study Results: GSA-SNP vs i-GSEA4GWAS.

  • Ji-Sun Kwon‎ et al.
  • Genomics & informatics‎
  • 2012‎

Gene set analysis (GSA) is useful in interpreting a genome-wide association study (GWAS) result in terms of biological mechanism. We compared the performance of two different GSA implementations that accept GWAS p-values of single nucleotide polymorphisms (SNPs) or gene-by-gene summaries thereof, GSA-SNP and i-GSEA4GWAS, under the same settings of inputs and parameters. GSA runs were made with two sets of p-values from a Korean type 2 diabetes mellitus GWAS study: 259,188 and 1,152,947 SNPs of the original and imputed genotype datasets, respectively. When Gene Ontology terms were used as gene sets, i-GSEA4GWAS produced 283 and 1,070 hits for the unimputed and imputed datasets, respectively. On the other hand, GSA-SNP reported 94 and 38 hits, respectively, for both datasets. Similar, but to a lesser degree, trends were observed with Kyoto Encyclopedia of Genes and Genomes (KEGG) gene sets as well. The huge number of hits by i-GSEA4GWAS for the imputed dataset was probably an artifact due to the scaling step in the algorithm. The decrease in hits by GSA-SNP for the imputed dataset may be due to the fact that it relies on Z-statistics, which is sensitive to variations in the background level of associations. Judicious evaluation of the GSA outcomes, perhaps based on multiple programs, is recommended.


The tiger genome and comparative analysis with lion and snow leopard genomes.

  • Yun Sung Cho‎ et al.
  • Nature communications‎
  • 2013‎

Tigers and their close relatives (Panthera) are some of the world's most endangered species. Here we report the de novo assembly of an Amur tiger whole-genome sequence as well as the genomic sequences of a white Bengal tiger, African lion, white African lion and snow leopard. Through comparative genetic analyses of these genomes, we find genetic signatures that may reflect molecular adaptations consistent with the big cats' hypercarnivorous diet and muscle strength. We report a snow leopard-specific genetic determinant in EGLN1 (Met39>Lys39), which is likely to be associated with adaptation to high altitude. We also detect a TYR260G>A mutation likely responsible for the white lion coat colour. Tiger and cat genomes show similar repeat composition and an appreciably conserved synteny. Genomic data from the five big cats provide an invaluable resource for resolving easily identifiable phenotypes evident in very close, but distinct, species.


Integrative annotation of 21,037 human genes validated by full-length cDNA clones.

  • Tadashi Imanishi‎ et al.
  • PLoS biology‎
  • 2004‎

The human genome sequence defines our inherent biological potential; the realization of the biology encoded therein requires knowledge of the function of each gene. Currently, our knowledge in this area is still limited. Several lines of investigation have been used to elucidate the structure and function of the genes in the human genome. Even so, gene prediction remains a difficult task, as the varieties of transcripts of a gene may vary to a great extent. We thus performed an exhaustive integrative characterization of 41,118 full-length cDNAs that capture the gene transcripts as complete functional cassettes, providing an unequivocal report of structural and functional diversity at the gene level. Our international collaboration has validated 21,037 human gene candidates by analysis of high-quality full-length cDNA clones through curation using unified criteria. This led to the identification of 5,155 new gene candidates. It also manifested the most reliable way to control the quality of the cDNA clones. We have developed a human gene database, called the H-Invitational Database (H-InvDB; http://www.h-invitational.jp/). It provides the following: integrative annotation of human genes, description of gene structures, details of novel alternative splicing isoforms, non-protein-coding RNAs, functional domains, subcellular localizations, metabolic pathways, predictions of protein three-dimensional structure, mapping of known single nucleotide polymorphisms (SNPs), identification of polymorphic microsatellite repeats within human genes, and comparative results with mouse full-length cDNAs. The H-InvDB analysis has shown that up to 4% of the human genome sequence (National Center for Biotechnology Information build 34 assembly) may contain misassembled or missing regions. We found that 6.5% of the human gene candidates (1,377 loci) did not have a good protein-coding open reading frame, of which 296 loci are strong candidates for non-protein-coding RNA genes. In addition, among 72,027 uniquely mapped SNPs and insertions/deletions localized within human genes, 13,215 nonsynonymous SNPs, 315 nonsense SNPs, and 452 indels occurred in coding regions. Together with 25 polymorphic microsatellite repeats present in coding regions, they may alter protein structure, causing phenotypic effects or resulting in disease. The H-InvDB platform represents a substantial contribution to resources needed for the exploration of human biology and pathology.


Binary image representation of a ligand binding site: its application to efficient sampling of a conformational ensemble.

  • Edon Sung‎ et al.
  • BMC bioinformatics‎
  • 2010‎

Modelling the ligand binding site of a protein is an important component of understanding protein-ligand interactions and is being actively studied. Even if the side chains are restricted to rotamers, a set of commonly-observed low-energy conformations, the exhaustive combinatorial search of ligand binding site conformers is known as NP-hard. Here we propose a new method, ROTAIMAGE, for modelling the plausible conformers for the ligand binding site given a fixed backbone structure.


Prediction of protein-protein interaction types using association rule based classification.

  • Sung Hee Park‎ et al.
  • BMC bioinformatics‎
  • 2009‎

Protein-protein interactions (PPI) can be classified according to their characteristics into, for example obligate or transient interactions. The identification and characterization of these PPI types may help in the functional annotation of new protein complexes and in the prediction of protein interaction partners by knowledge driven approaches.


Impaired DNA damage response, genome instability, and tumorigenesis in SIRT1 mutant mice.

  • Rui-Hong Wang‎ et al.
  • Cancer cell‎
  • 2008‎

In lower eukaryotes, Sir2 serves as a histone deacetylase and is implicated in chromatin silencing, longevity, and genome stability. Here we mutated the Sirt1 gene, a homolog of yeast Sir2, in mice to study its function. We show that a majority of SIRT1 null embryos die between E9.5 and E14.5, displaying altered histone modification, impaired DNA damage response, and reduced ability to repair DNA damage. We demonstrate that Sirt1(+/-);p53(+/-) mice develop tumors in multiple tissues, whereas activation of SIRT1 by resveratrol treatment reduces tumorigenesis. Finally, we show that many human cancers exhibit reduced levels of SIRT1 compared to normal controls. Thus, SIRT1 may act as a tumor suppressor through its role in DNA damage response and genome integrity.


Comparison of carnivore, omnivore, and herbivore mammalian genomes with a new leopard assembly.

  • Soonok Kim‎ et al.
  • Genome biology‎
  • 2016‎

There are three main dietary groups in mammals: carnivores, omnivores, and herbivores. Currently, there is limited comparative genomics insight into the evolution of dietary specializations in mammals. Due to recent advances in sequencing technologies, we were able to perform in-depth whole genome analyses of representatives of these three dietary groups.


The association of genetic alterations with response rate in newly diagnosed chronic myeloid leukemia patients.

  • Hyunkyung Park‎ et al.
  • Leukemia research‎
  • 2022‎

Genetic differences may be associated with the response to tyrosine kinase inhibitor (TKI) in patients with chronic myeloid leukemia (CML). In this study, we identified genetic alterations between rapid and slow responders (BCR/ABL1 International Scale at 6 months: ≤0.1 % vs. > 0.1 %) of TKI treatment in chronic phase CML patients. Our analyses involved single nucleotide polymorphism (SNP), a Genome Wide Association Study and a Network-wide Association Study (NetWAS). Seventy-two patients from 16 institutions were enrolled and treated with a TKI, nilotinib. Gene Set Analysis identified genetic alterations in pathways related to the differentiation, proliferation, and activity of various innate immune cells. The NetWAS analysis found that genes associated with natural killer (NK) cells (PTPRCAP, BLNK, HCK, ARHGEF11, GPR183, TRPV2, SHKBP1, CD2) showed significant differences between rapid and slow responders of nilotinib. However, we found no significantly different genetic alterations according to the response in the SNP analysis. In conclusion, we found that rapidity of response to TKI was associated with pathway-associated genetic alterations in immune cells, particularly with respect to NK cell activity. These results suggested that the innate immune system at initial diagnosis had an important role in treatment response in patients with CML.


Characterization of HIV-1 recombinant and subtype B near full-length genome among men who have sex with men in South Korea.

  • Sangmi Ryou‎ et al.
  • Scientific reports‎
  • 2021‎

In Korea, subtype B is the predominant variant of HIV-1, but full genome sequencing and analysis of its viral variants are lacking. We performed near full-length genome (NFLG) sequencing and phylogenetic and recombination analyses of fifty plasma samples from HIV-positive men who have sex with men (MSM) from a Korea HIV/AIDS cohort study. Viral genomes were amplified and the near-full-length sequences were determined using next-generation sequencing (NGS) and Sanger sequencing. We focused on the HIV-1 subtype classification and identification of HIV recombinants. Twelve HIV-1 NFLGs were determined: ten were subtyped as pure HIV-1 subtype B and two recombinant strains as a common subtype CRF07_BC, and a novel subtype CRF43_02G recombined with CRF02_AG again, or a new CRF02_AG and subtype G recombinant. For the ten NFLGs determined by NGS, "the novel recombinant emerged at approximately 2003 and the other nine subtype B about 2004 or 2005". This is the first report analyzing HIV-1 NFLG, including recombinants and clinical characteristics, by subtype among MSM in Korea. Our results provide novel insights for understanding the recombinants in the HIV-1 epidemic in Korea.


A streamlined pipeline based on HmmUFOtu for microbial community profiling using 16S rRNA amplicon sequencing.

  • Hyeonwoo Kim‎ et al.
  • Genomics & informatics‎
  • 2023‎

Microbial community profiling using 16S rRNA amplicon sequencing allows for taxonomic characterization of diverse microorganisms. While amplicon sequence variant (ASV) methods are increasingly favored for their fine-grained resolution of sequence variants, they often discard substantial portions of sequencing reads during quality control, particularly in datasets with large number samples. We present a streamlined pipeline that integrates FastP for read trimming, HmmUFOtu for operational taxonomic units (OTU) clustering, Vsearch for chimera checking, and Kraken2 for taxonomic assignment. To assess the pipeline's performance, we reprocessed two published stool datasets of normal Korean populations: one with 890 and the other with 1,462 independent samples. In the first dataset, HmmUFOtu retained 93.2% of over 104 million read pairs after quality trimming, discarding chimeric or unclassifiable reads, while DADA2, a commonly used ASV method, retained only 44.6% of the reads. Nonetheless, both methods yielded qualitatively similar β-diversity plots. For the second dataset, HmmUFOtu retained 89.2% of read pairs, while DADA2 retained a mere 18.4% of the reads. HmmUFOtu, being a closed-reference clustering method, facilitates merging separately processed datasets, with shared OTUs between the two datasets exhibiting a correlation coefficient of 0.92 in total abundance (log scale). While the first two dimensions of the β-diversity plot exhibited a cohesive mixture of the two datasets, the third dimension revealed the presence of a batch effect. Our comparative evaluation of ASV and OTU methods within this streamlined pipeline provides valuable insights into their performance when processing large-scale microbial 16S rRNA amplicon sequencing data. The strengths of HmmUFOtu and its potential for dataset merging are highlighted.


In silico Identification of SFRP1 as a Hypermethylated Gene in Colorectal Cancers.

  • Jongbum Kim‎ et al.
  • Genomics & informatics‎
  • 2014‎

Aberrant DNA methylation, as an epigenetic marker of cancer, influences tumor development and progression. We downloaded publicly available DNA methylation and gene expression datasets of matched cancer and normal pairs from the Cancer Genome Atlas Data Portal and performed a systematic computational analysis. This study has three aims to screen genes that show hypermethylation and downregulated patterns in colorectal cancers, to identify differentially methylated regions in one of these genes, SFRP1, and to test whether the SFRP genes affect survival or not. Our results show that 31 hypermethylated genes had a negative correlation with gene expression. Among them, SFRP1 had a differentially methylated pattern at each methylation site. We also show that SFRP1 may be a potential biomarker for colorectal cancer survival.


  1. SciCrunch.org Resources

    Welcome to the FDI Lab - SciCrunch.org Resources search. From here you can search through a compilation of resources used by FDI Lab - SciCrunch.org and see how data is organized within our community.

  2. Navigation

    You are currently on the Community Resources tab looking through categories and sources that FDI Lab - SciCrunch.org has compiled. You can navigate through those categories from here or change to a different tab to execute your search through. Each tab gives a different perspective on data.

  3. Logging in and Registering

    If you have an account on FDI Lab - SciCrunch.org then you can log in from here to get additional features in FDI Lab - SciCrunch.org such as Collections, Saved Searches, and managing Resources.

  4. Searching

    Here is the search term that is being executed, you can type in anything you want to search for. Some tips to help searching:

    1. Use quotes around phrases you want to match exactly
    2. You can manually AND and OR terms to change how we search between words
    3. You can add "-" to terms to make sure no results return with that term in them (ex. Cerebellum -CA1)
    4. You can add "+" to terms to require they be in the data
    5. Using autocomplete specifies which branch of our semantics you with to search and can help refine your search
  5. Save Your Search

    You can save any searches you perform for quick access to later from here.

  6. Query Expansion

    We recognized your search term and included synonyms and inferred terms along side your term to help get the data you are looking for.

  7. Collections

    If you are logged into FDI Lab - SciCrunch.org you can add data records to your collections to create custom spreadsheets across multiple sources of data.

  8. Facets

    Here are the facets that you can filter your papers by.

  9. Options

    From here we'll present any options for the literature, such as exporting your current results.

  10. Further Questions

    If you have any further questions please check out our FAQs Page to ask questions and see our tutorials. Click this button to view this tutorial again.

Publications Per Year

X

Year:

Count: