Forgot Password

If you have forgotten your password you can enter your email here and get a temporary password sent to your email.

Forgot Password

If you have forgotten your password you can enter your email here and get a temporary password sent to your email.

Publication

Transcriptional and Mutational Profiling of B-Other Acute Lymphoblastic Leukemia for Improved Diagnostics.

Cancers | 2021

B-cell precursor acute lymphoblastic leukemia (BCP-ALL) is the most common cancer in children, and significant progress has been made in diagnostics and the treatment of this disease based on the subtypes of BCP-ALL. However, in a large proportion of cases (B-other), recurrent BCP-ALL-associated genomic alterations remain unidentifiable by current diagnostic procedures. In this study, we performed RNA sequencing and analyzed gene fusions, expression profiles, and mutations in diagnostic samples of 185 children with BCP-ALL. Gene expression clustering showed that a subset of B-other samples partially clusters with some of the known subgroups, particularly DUX4-positive. Mutation analysis coupled with gene expression profiling revealed the presence of distinctive BCP-ALL subgroups, characterized by the presence of mutations in known ALL driver genes, e.g., PAX5 and IKZF1. Moreover, we identified novel fusion partners of lymphoid lineage transcriptional factors ETV6, IKZF1 and PAX5. In addition, we report on low blast count detection thresholds and show that the use of EDTA tubes for sample collection does not have adverse effects on sequencing and downstream analysis. Taken together, our findings demonstrate the applicability of whole-transcriptome sequencing for personalized diagnostics in pediatric ALL, including tentative classification of the B-other cases that are difficult to diagnose using conventional methods.

Pubmed ID: 34830809 RIS Download

Research resources used in this publication

None found

Additional research tools detected in this publication

Antibodies used in this publication

None found

Associated grants

None

Publication data is provided by the National Library of Medicine ® and PubMed ®. Data is retrieved from PubMed ® on a weekly schedule. For terms and conditions see the National Library of Medicine Terms and Conditions.

This is a list of tools and resources that we have found mentioned in this publication.

GATK (tool)

RRID:SCR_001876

A software package to analyze next-generation resequencing data. The toolkit offers a wide variety of tools, with a primary focus on variant discovery and genotyping as well as strong emphasis on data quality assurance. Its robust architecture, powerful processing engine and high-performance computing features make it capable of taking on projects of any size. This software library makes writing efficient analysis tools using next-generation sequencing data very easy, and second it's a suite of tools for working with human medical resequencing projects such as 1000 Genomes and The Cancer Genome Atlas. These tools include things like a depth of coverage analyzers, a quality score recalibrator, a SNP/indel caller and a local realigner. (entry from Genetic Analysis Software)

View all literature mentions

Gene Set Enrichment Analysis (tool)

RRID:SCR_003199

Software package for interpreting gene expression data. Used for interpretation of a large-scale experiment by identifying pathways and processes.

View all literature mentions

ExAc (tool)

RRID:SCR_004068

THIS RESOURCE IS NO LONGER IN SERVICE. Documented on January 9, 2023. An aggregated data platform for genome sequencing data created by a coalition of investigators seeking to aggregate and harmonize exome sequencing data from a variety of large-scale sequencing projects, and to make summary data available for the wider scientific community. The data set provided on this website spans 61,486 unrelated individuals sequenced as part of various disease-specific and population genetic studies. They have removed individuals affected by severe pediatric disease, so this data set should serve as a useful reference set of allele frequencies for severe disease studies. All of the raw data from these projects have been reprocessed through the same pipeline, and jointly variant-called to increase consistency across projects. They ask that you not publish global (genome-wide) analyses of these data until after the ExAC flagship paper has been published, estimated to be in early 2015. If you''re uncertain which category your analyses fall into, please email them. The aggregation and release of summary data from the exomes collected by the Exome Aggregation Consortium has been approved by the Partners IRB (protocol 2013P001477, Genomic approaches to gene discovery in rare neuromuscular diseases).

View all literature mentions

STAR (tool)

RRID:SCR_004463

Software performing alignment of high-throughput RNA-seq data. Aligns RNA-seq reads to reference genome using uncompressed suffix arrays.

View all literature mentions

EDASeq (tool)

RRID:SCR_006751

Software for numerical and graphical summaries of RNA-Seq read data. Within-lane normalization procedures to adjust for GC-content effect (or other gene-level effects) on read counts: loess robust local regression, global-scaling, and full-quantile normalization (Risso et al., 2011). Between-lane normalization procedures to adjust for distributional differences between lanes (e.g., sequencing depth): global-scaling and full-quantile normalization (Bullard et al., 2010).

View all literature mentions

1000 Genomes: A Deep Catalog of Human Genetic Variation (tool)

RRID:SCR_006828

International collaboration producing an extensive public catalog of human genetic variation, including SNPs and structural variants, and their haplotype contexts, in an effort to provide a foundation for investigating the relationship between genotype and phenotype. The genomes of about 2500 unidentified people from about 25 populations around the world were sequenced using next-generation sequencing technologies. Redundant sequencing on various platforms and by different groups of scientists of the same samples can be compared. The results of the study are freely and publicly accessible to researchers worldwide. The consortium identified the following populations whose DNA will be sequenced: Yoruba in Ibadan, Nigeria; Japanese in Tokyo; Chinese in Beijing; Utah residents with ancestry from northern and western Europe; Luhya in Webuye, Kenya; Maasai in Kinyawa, Kenya; Toscani in Italy; Gujarati Indians in Houston; Chinese in metropolitan Denver; people of Mexican ancestry in Los Angeles; and people of African ancestry in the southwestern United States. The goal Project is to find most genetic variants that have frequencies of at least 1% in the populations studied. Sequencing is still too expensive to deeply sequence the many samples being studied for this project. However, any particular region of the genome generally contains a limited number of haplotypes. Data can be combined across many samples to allow efficient detection of most of the variants in a region. The Project currently plans to sequence each sample to about 4X coverage; at this depth sequencing cannot provide the complete genotype of each sample, but should allow the detection of most variants with frequencies as low as 1%. Combining the data from 2500 samples should allow highly accurate estimation (imputation) of the variants and genotypes for each sample that were not seen directly by the light sequencing. All samples from the 1000 genomes are available as lymphoblastoid cell lines (LCLs) and LCL derived DNA from the Coriell Cell Repository as part of the NHGRI Catalog. The sequence and alignment data generated by the 1000genomes project is made available as quickly as possible via their mirrored ftp sites. ftp://ftp.1000genomes.ebi.ac.uk ftp://ftp-trace.ncbi.nlm.nih.gov/1000genomes

View all literature mentions

Abbott Diagnostics (tool)

RRID:SCR_008392

Company offering a broad range of instrument systems and diagnostic tests for hospitals, reference labs, blood banks, physician offices and clinics to aid in the diagnosis of a range of serious health issues such as infectious diseases, cancer, and diabetes, as well as monitor other important indicators of health.

View all literature mentions

QIAGEN (tool)

RRID:SCR_008539

A commercial organization which provides assay technologies to isolate DNA, RNA, and proteins from any biological sample. Assay technologies are then used to make specific target biomolecules, such as the DNA of a specific virus, visible for subsequent analysis.

View all literature mentions

Condel (tool)

RRID:SCR_008584

A method to assess the outcome of nonsynonymous SNVs using a consensus deleteriousness score that combines various tools (e.g. SIFT, Polyphen2, MutationAssessor).

View all literature mentions

Sigma-Aldrich (tool)

RRID:SCR_008988

American chemical, life science and biotechnology company owned by Merck KGaA. Merger of Sigma Chemical Company and Aldrich Chemical Company. Provides organic and inorganic chemicals, building blocks, reagents, advanced materials and stable isotopes for chemical synthesis, medicinal chemistry and materials science, antibiotics, buffers, carbohydrates, enzymes, forensic tools, hematology and histology, nucleotides, proteins, peptides, amino acids and their derivatives.

View all literature mentions

SIFT (tool)

RRID:SCR_012813

Data analysis service to predict whether an amino acid substitution affects protein function based on sequence homology and the physical properties of amino acids. SIFT can be applied to naturally occurring nonsynonymous polymorphisms and laboratory-induced missense mutations. (entry from Genetic Analysis Software) Web service is also available.

View all literature mentions

RSEM (tool)

RRID:SCR_013027

Software package for quantifying gene and isoform abundances from single end or paired end RNA Seq data. Accurate transcript quantification from RNA Seq data with or without reference genome. Used for accurate quantification of gene and isoform expression from RNA-Seq data.

View all literature mentions

DESeq2 (tool)

RRID:SCR_015687

Software package for differential gene expression analysis based on the negative binomial distribution. Used for analyzing RNA-seq data for differential analysis of count data, using shrinkage estimation for dispersions and fold changes to improve stability and interpretability of estimates.

View all literature mentions

maftools (tool)

RRID:SCR_024519

Software R package offers multitude of analysis and visualization modules that are commonly used in cancer genomic studies, including driver gene identification, pathway, signature, enrichment, and association analyses. Maftools requires somatic variants in Mutation Annotation Format (MAF) and is independent of larger alignment files.

View all literature mentions

About

The SciCrunch Infrastructure was developed as a cooperative data platform to be used by diverse communities in making data more FAIR.

Contact Us

FAIR Data Informatics Lab

University of California, San Diego

9500 Gilman Drive, Mail Code 0608

La Jolla, CA 92093-0608

United States

info

scicrunch.org

About SciCrunch | Privacy Policy | Terms of Service

Searching across hundreds of databases

Our searching services are busy right now. Your search will reload in five seconds.

Transcriptional and Mutational Profiling of B-Other Acute Lymphoblastic Leukemia for Improved Diagnostics.

Research resources used in this publication

Additional research tools detected in this publication

Antibodies used in this publication

Associated grants

This is a list of tools and resources that we have found mentioned in this publication.