This service exclusively searches for literature that cites resources. Please be aware that the total number of searchable documents is limited to those containing RRIDs and does not include all open-access literature.
The realization that cancer progression required the participation of cellular genes provided one of several key rationales, in 1986, for embarking on the human genome project. Only with a reference genome sequence could the full spectrum of somatic changes leading to cancer be understood. Since its completion in 2003, the human reference genome sequence has fulfilled its promise as a foundational tool to illuminate the pathogenesis of cancer. Herein, we review the key historical milestones in cancer genomics since the completion of the genome, and some of the novel discoveries that are shaping our current understanding of cancer.
Although parental genomes undergo extensive epigenetic reprogramming to be equalized after fertilization, whether they play different roles in human zygotic genome activation (ZGA) remains unknown. Here, we mapped parental transcriptomes by using human parthenogenetic (PG) and androgenetic (AG) embryos during ZGA. Our data show that human ZGA is launched at the 8-cell stage in AG and bi-parental embryos, but at the morula stage in PG embryos. In contrast, mouse ZGA occurs at the same stage in PG and AG embryos. Mechanistically, primate-specific ZNF675 with AG-specific expression plays a role in human ZGA initiated from paternal genome at the 8-cell stage. AG-specifically expressed LSM1 is also critical for human maternal RNA degradation (MRD) and ZGA. The allelic expressions of ZNF675 and LSM1 are associated with their allelically epigenetic states. Notably, the paternally specific expressions of ZNF675 and LSM1 are also observed in diploid embryos. Collectively, human ZGA is initiated from paternal genome.
Most gene editing technologies introduce breaks or nicks into DNA, leading to the generation of mutagenic insertions and deletions by non-homologous end-joining repair. Here, we report a new, cleavage-free gene editing approach based on replication interrupted template-driven DNA modification (RITDM). The RITDM system makes use of sequence-specific DLR fusion molecules that are specifically designed to enable localized, temporary blockage of DNA replication fork progression, thereby exposing single-stranded DNA that can be bound by DNA sequence modification templates for precise editing. We evaluate the use of zinc-finger arrays for sequence recognition. We demonstrate that RITDM can be used for gene editing at endogenous genomic loci in human cells and highlight its safety profile of low indel frequencies and undetectable off-target side effects in RITDM-edited clones and pools of cells.
Increasing evidence suggests that interactions between regulatory genomic elements play an important role in regulating gene expression. We generated a genome-wide interaction map of regulatory elements in human cells (ENCODE tier 1 cells, K562, GM12878) using Chromatin Interaction Analysis by Paired-End Tag sequencing (ChIA-PET) experiments targeting six broadly distributed factors. Bound regions covered 80% of DNase I hypersensitive sites including 99.7% of TSS and 98% of enhancers. Correlating this map with ChIP-seq and RNA-seq data sets revealed cohesin, CTCF, and ZNF143 as key components of three-dimensional chromatin structure and revealed how the distal chromatin state affects gene transcription. Comparison of interactions between cell types revealed that enhancer-promoter interactions were highly cell-type-specific. Construction and comparison of distal and proximal regulatory networks revealed stark differences in structure and biological function. Proximal binding events are enriched at genes with housekeeping functions, while distal binding events interact with genes involved in dynamic biological processes including response to stimulus. This study reveals new mechanistic and functional insights into regulatory region organization in the nucleus.
The hookworm Necator americanus is the predominant soil-transmitted human parasite. Adult worms feed on blood in the small intestine, causing iron-deficiency anemia, malnutrition, growth and development stunting in children, and severe morbidity and mortality during pregnancy in women. We report sequencing and assembly of the N. americanus genome (244 Mb, 19,151 genes). Characterization of this first hookworm genome sequence identified genes orchestrating the hookworm's invasion of the human host, genes involved in blood feeding and development, and genes encoding proteins that represent new potential drug targets against hookworms. N. americanus has undergone a considerable and unique expansion of immunomodulator proteins, some of which we highlight as potential treatments against inflammatory diseases. We also used a protein microarray to demonstrate a postgenomic application of the hookworm genome sequence. This genome provides an invaluable resource to boost ongoing efforts toward fundamental and applied postgenomic research, including the development of new methods to control hookworm and human immunological diseases.
The use of custom-engineered sequence-specific nucleases (including CRISPR/Cas9, ZFN, and TALEN) allows genetic changes in human cells to be easily made with much greater efficiency and precision than before. Engineered double-stranded DNA breaks can efficiently disrupt genes, or, with the right donor vector, engineer point mutations and gene insertions. However, a number of design considerations should be taken into account to ensure maximum gene targeting efficiency and specificity. This is especially true when engineering human embryonic stem or induced pluripotent stem cells (iPSCs), which are more difficult to transfect and less resilient to DNA damage than immortalized tumor cell lines. Here, we describe a protocol for easily engineering genetic changes in human iPSCs, through which we typically achieve targeting efficiencies between 1% and 10% without any subsequent selection steps. Since this protocol only uses the simple transient transfection of plasmids and/or single-stranded oligonucleotides, most labs will easily be able to perform it. We also describe strategies for identifying, cloning, and genotyping successfully edited cells, and how to design the optimal sgRNA target sites and donor vectors. Finally, we discuss alternative methods for gene editing including viral delivery vectors, Cas9 nickases, and orthogonal Cas9 systems.
Single-cell genome analyses of human oocytes are important for meiosis research and preimplantation genomic screening. However, the nonuniformity of single-cell whole-genome amplification hindered its use. Here, we demonstrate genome analyses of single human oocytes using multiple annealing and looping-based amplification cycle (MALBAC)-based sequencing technology. By sequencing the triads of the first and second polar bodies (PB1 and PB2) and the oocyte pronuclei from same female egg donors, we phase the genomes of these donors with detected SNPs and determine the crossover maps of their oocytes. Our data exhibit an expected crossover interference and indicate a weak chromatid interference. Further, the genome of the oocyte pronucleus, including information regarding aneuploidy and SNPs in disease-associated alleles, can be accurately deduced from the genomes of PB1 and PB2. The MALBAC-based preimplantation genomic screening in in vitro fertilization (IVF) enables accurate and cost-effective selection of normal fertilized eggs for embryo transfer.
Here we studied one special type of gene, i.e., the nested gene, in the human genome. We collected 373 reliably annotated nested genes. Two-thirds of them were on the strand opposite that of their host gene. About 58% coding nested gene pairs were conserved in mouse and some were even maintained in chicken and fish, while nested pseudogenes were poorly conserved. Ka/Ks analysis revealed that nested genes were under strong selection, although they did not demonstrate greater conservation than other genes. With microarray data we observed that two partners of one nested pair seemed to be expressed reciprocally. A significant proportion of nested genes were tissue-specifically expressed. Gene ontology analysis demonstrated that quite a number of nested genes participated in cellular signal transduction. Based on these observations, we think that nested genes are a group of genes with important physiological functions.
Polymorphic minisatellites, also known as variable number of tandem repeats (VNTRs), are tandem repeat regions that show variation in the number of repeat units among chromosomes in a population. Currently, there are no general methods for predicting which minisatellites have a high probability of being polymorphic, given their sequence characteristics. An earlier approach has focused on potentially highly polymorphic and hypervariable minisatellites, which make up only a small fraction of all minisatellites in the human genome. We have developed a model, based on available minisatellite and VNTR sequence data, that predicts the probability that a minisatellite (unit size > or = 6 bp) identified by the computer program Tandem Repeats Finder is polymorphic (VNTR). According to the model, minisatellites with high copy number and high degree of sequence similarity are most likely to be VNTRs. This approach was used to scan the draft sequence of the human genome for VNTRs. A total of 157,549 minisatellite repeats were found, of which 29,224 are predicted to be VNTRs. Contrary to previous results, VNTRs appear to be widespread and abundant throughout the human genome, with an estimated density of 9.1 VNTRs/Mb.
Contamination in genome assembly can lead to wrong or confusing results when using such genome as reference in sequence comparison. Although bacterial contamination is well known, the problem of human-originated contamination received little attention. In this study we surveyed 45,735 available genome assemblies for evidence of human contamination. We used lineage specificity to distinguish between contamination and conservation. We found that 154 genome assemblies contain fragments that with high confidence originate as contamination from human DNA. Majority of contaminating human sequences were present in the reference human genome assembly for over a decade. We recommend that existing contaminated genomes should be revised to remove contaminated sequence, and that new assemblies should be thoroughly checked for presence of human DNA before submitting them to public databases.
Despite intense investigation, human replication origins and termini remain elusive. Existing data have shown strong discrepancies. Here we sequenced highly purified Okazaki fragments from two cell types and, for the first time, quantitated replication fork directionality and delineated initiation and termination zones genome-wide. Replication initiates stochastically, primarily within non-transcribed, broad (up to 150 kb) zones that often abut transcribed genes, and terminates dispersively between them. Replication fork progression is significantly co-oriented with the transcription. Initiation and termination zones are frequently contiguous, sometimes separated by regions of unidirectional replication. Initiation zones are enriched in open chromatin and enhancer marks, even when not flanked by genes, and often border 'topologically associating domains' (TADs). Initiation zones are enriched in origin recognition complex (ORC)-binding sites and better align to origins previously mapped using bubble-trap than λ-exonuclease. This novel panorama of replication reveals how chromatin and transcription modulate the initiation process to create cell-type-specific replication programs.
High-profile genomic variation projects like the 1000 Genomes project or the Exome Aggregation Consortium, are generating a wealth of human genomic variation knowledge which can be used as an essential reference for identifying disease-causing genotypes. However, accessing these data, contrasting the various studies and integrating those data in downstream analyses remains cumbersome. The Human Genome Variation Archive (HGVA) tackles these challenges and facilitates access to genomic data for key reference projects in a clean, fast and integrated fashion. HGVA provides an efficient and intuitive web-interface for easy data mining, a comprehensive RESTful API and client libraries in Python, Java and JavaScript for fast programmatic access to its knowledge base. HGVA calculates population frequencies for these projects and enriches their data with variant annotation provided by CellBase, a rich and fast annotation solution. HGVA serves as a proof-of-concept of the genome analysis developments being carried out by the University of Cambridge together with UK's 100 000 genomes project and the National Institute for Health Research BioResource Rare-Diseases, in particular, deploying open-source for Computational Biology (OpenCB) software platform for storing and analyzing massive genomic datasets.
The most widely used human genome reference assembly hg19 harbors minor alleles at 2.18 million positions as revealed by 1000 Genome Phase 3 dataset. Although this is less than 2% of the 89 million variants reported, it has been shown that the minor alleles can result in 30% false positives in individual genomes, thus misleading and burdening downstream interpretation. More alarming is the fact that, significant percentage of variants that are homozygous recessive for these minor alleles, with potential disease implications, are masked from reporting.
Gene duplicates generated via retroposition were long thought to be pseudogenized and consequently decayed. However, a significant number of these genes escaped their evolutionary destiny and evolved into functional genes. Despite multiple studies, the number of functional retrogenes in human and other genomes remains unclear. We performed a comparative analysis of human, chicken, and worm genomes to identify "orphan" retrogenes, that is, retrogenes that have replaced their progenitors. We located 25 such candidates in the human genome. All of these genes were previously known, and the majority has been intensively studied. Despite this, they have never been recognized as retrogenes. Analysis revealed that the phenomenon of replacing parental genes with their retrocopies has been taking place over the entire span of animal evolution. This process was often species specific and contributed to interspecies differences. Surprisingly, these retrogenes, which should evolve in a more relaxed mode, are subject to a very strong purifying selection, which is, on average, two and a half times stronger than other human genes. Also, for retrogenes, they do not show a typical overall tendency for a testis-specific expression. Notably, seven of them are associated with human diseases. Recognizing them as "orphan" retrocopies, which have different regulatory machinery than their parents, is important for any disease studies in model organisms, especially when discoveries made in one species are transferred to humans.
The Saccharomyces Genome Database (SGD; http://www.yeastgenome.org) is an expertly curated database of literature-derived functional information for the model organism budding yeast, Saccharomyces cerevisiae. SGD constantly strives to synergize new types of experimental data and bioinformatics predictions with existing data, and to organize them into a comprehensive and up-to-date information resource. The primary mission of SGD is to facilitate research into the biology of yeast and to provide this wealth of information to advance, in many ways, research on other organisms, even those as evolutionarily distant as humans. To build such a bridge between biological kingdoms, SGD is curating data regarding yeast-human complementation, in which a human gene can successfully replace the function of a yeast gene, and/or vice versa. These data are manually curated from published literature, made available for download, and incorporated into a variety of analysis tools provided by SGD.
Characterizing large genomic variants is essential to expanding the research and clinical applications of genome sequencing. While multiple data types and methods are available to detect these structural variants (SVs), they remain less characterized than smaller variants because of SV diversity, complexity, and size. These challenges are exacerbated by the experimental and computational demands of SV analysis. Here, we characterize the SV content of a personal genome with Parliament, a publicly available consensus SV-calling infrastructure that merges multiple data types and SV detection methods.
CRISPR guide RNA libraries have been iteratively improved to provide increasingly efficient reagents, although their large size is a barrier for many applications. We design an optimised minimal genome-wide human CRISPR-Cas9 library (MinLibCas9) by mining existing large-scale gene loss-of-function datasets, resulting in a greater than 42% reduction in size compared to other CRISPR-Cas9 libraries while preserving assay sensitivity and specificity. MinLibCas9 provides backward compatibility with existing datasets, increases the dynamic range of CRISPR-Cas9 screens and extends their application to complex models and assays.
The sequence of the human genome provides a scaffold on which numerous annotations, such the locations of genes, can be laid. Genome browsers have been created to allow the simultaneous display of multiple annotations within a graphical interface. In addition, they provide the ability to search for markers and sequences, to extract annotations for specific regions or for the whole genome and to act as a central starting point for genomic research. This review describes the basic functionality of genome browsers and compares three of them: the University of California Santa Cruz (UCSC) Genome Browser, the Ensembl Genome Browser and the NCBI MapViewer.
Significant advances have been made over the past 5 years in mapping and characterizing structural variation in the human genome. Despite this progress, our understanding of inversion variants is still very restricted. While unbalanced variants such as copy number variations can be mapped using array-based approaches, strategies for characterization of inversion variants have been limited and underdeveloped. Traditional cytogenetic approaches have long been able to identify microscopic inversion events, but discovery of submicroscopic events has remained elusive and largely ignored. With the advent of paired-end sequencing approaches, it is now possible to map inversions across the human genome. Based on the paired-end sequencing studies published to date, it is now feasible to make a first map of inversions across the human genome and to use this map to explore the characteristics and distribution of this form of variation. The current map of inversions indicates that many remain to be identified, especially in the smaller size ranges. This review provides an overview of the current knowledge about human inversions and their contribution to human phenotypes. Further characterization of inversions should be considered as an important step towards a deeper understanding of human variation and genome dynamics.
Welcome to the FDI Lab - SciCrunch.org Resources search. From here you can search through a compilation of resources used by FDI Lab - SciCrunch.org and see how data is organized within our community.
You are currently on the Community Resources tab looking through categories and sources that FDI Lab - SciCrunch.org has compiled. You can navigate through those categories from here or change to a different tab to execute your search through. Each tab gives a different perspective on data.
If you have an account on FDI Lab - SciCrunch.org then you can log in from here to get additional features in FDI Lab - SciCrunch.org such as Collections, Saved Searches, and managing Resources.
Here is the search term that is being executed, you can type in anything you want to search for. Some tips to help searching:
You can save any searches you perform for quick access to later from here.
We recognized your search term and included synonyms and inferred terms along side your term to help get the data you are looking for.
If you are logged into FDI Lab - SciCrunch.org you can add data records to your collections to create custom spreadsheets across multiple sources of data.
Here are the facets that you can filter your papers by.
From here we'll present any options for the literature, such as exporting your current results.
If you have any further questions please check out our FAQs Page to ask questions and see our tutorials. Click this button to view this tutorial again.
Year:
Count: