Searching across hundreds of databases

Our searching services are busy right now. Your search will reload in five seconds.

X
Forgot Password

If you have forgotten your password you can enter your email here and get a temporary password sent to your email.

X
Forgot Password

If you have forgotten your password you can enter your email here and get a temporary password sent to your email.

Aberrant splicing isoforms detected by full-length transcriptome sequencing as transcripts of potential neoantigens in non-small cell lung cancer.

Genome biology | 2021

Long-read sequencing of full-length cDNAs enables the detection of structures of aberrant splicing isoforms in cancer cells. These isoforms are occasionally translated, presented by HLA molecules, and recognized as neoantigens. This study used a long-read sequencer (MinION) to construct a comprehensive catalog of aberrant splicing isoforms in non-small-cell lung cancers, by which novel isoforms and potential neoantigens are identified.

Pubmed ID: 33397462 RIS Download

Publication data is provided by the National Library of Medicine ® and PubMed ®. Data is retrieved from PubMed ® on a weekly schedule. For terms and conditions see the National Library of Medicine Terms and Conditions.

This is a list of tools and resources that we have found mentioned in this publication.


GATK (tool)

RRID:SCR_001876

A software package to analyze next-generation resequencing data. The toolkit offers a wide variety of tools, with a primary focus on variant discovery and genotyping as well as strong emphasis on data quality assurance. Its robust architecture, powerful processing engine and high-performance computing features make it capable of taking on projects of any size. This software library makes writing efficient analysis tools using next-generation sequencing data very easy, and second it's a suite of tools for working with human medical resequencing projects such as 1000 Genomes and The Cancer Genome Atlas. These tools include things like a depth of coverage analyzers, a quality score recalibrator, a SNP/indel caller and a local realigner. (entry from Genetic Analysis Software)

View all literature mentions

COSMIC - Catalogue Of Somatic Mutations In Cancer (tool)

RRID:SCR_002260

Database to store and display somatic mutation information and related details and contains information relating to human cancers. The mutation data and associated information is extracted from the primary literature. In order to provide a consistent view of the data a histology and tissue ontology has been created and all mutations are mapped to a single version of each gene. The data can be queried by tissue, histology or gene and displayed as a graph, as a table or exported in various formats.
Some key features of COSMIC are:
* Contains information on publications, samples and mutations. Includes samples which have been found to be negative for mutations during screening therefore enabling frequency data to be calculated for mutations in different genes in different cancer types.
* Samples entered include benign neoplasms and other benign proliferations, in situ and invasive tumours, recurrences, metastases and cancer cell lines.

View all literature mentions

dbSNP (tool)

RRID:SCR_002338

Database as central repository for both single base nucleotide substitutions and short deletion and insertion polymorphisms. Distinguishes report of how to assay SNP from use of that SNP with individuals and populations. This separation simplifies some issues of data representation. However, these initial reports describing how to assay SNP will often be accompanied by SNP experiments measuring allele occurrence in individuals and populations. Community can contribute to this resource.

View all literature mentions

Cytoscape (tool)

RRID:SCR_003032

Software platform for complex network analysis and visualization. Used for visualization of molecular interaction networks and biological pathways and integrating these networks with annotations, gene expression profiles and other state data.

View all literature mentions

RefSeq (tool)

RRID:SCR_003496

Collection of curated, non-redundant genomic DNA, transcript RNA, and protein sequences produced by NCBI. Provides a reference for genome annotation, gene identification and characterization, mutation and polymorphism analysis, expression studies, and comparative analyses. Accessed through the Nucleotide and Protein databases.

View all literature mentions

Systems Transcriptional Activity Reconstruction (tool)

RRID:SCR_005622

A next-generation web-based application that aims to provide an integrated solution for both visualization and analysis of deep-sequencing data, along with simple access to public datasets.

View all literature mentions

Promega (tool)

RRID:SCR_006724

An Antibody supplier

View all literature mentions

Variant Effect Predictor (tool)

RRID:SCR_007931

Data analysis service to predict the functional consequences of known and unknown variants.

View all literature mentions

WEBLOGO (tool)

RRID:SCR_010236

Web application to generate sequence logos, graphical representations of patterns within multiple sequence alignment. Designed to make generation of sequence logos easy. Sequence logo generator.

View all literature mentions

Trimmomatic (tool)

RRID:SCR_011848

Software Java pipeline for trimming tasks for Illumina paired end and single ended data. Flexible Trimmer for Illumina Sequence Data. Pair aware preprocessing tool optimized for Illumina next generation sequencing data. Includes several processing steps for read trimming and filtering. Operating systems Unix/Linux, Mac OS, Windows.

View all literature mentions

BLAT (tool)

RRID:SCR_011919

Software designed to quickly find sequences of 95% and greater similarity of length 25 bases or more.

View all literature mentions

RepeatMasker (tool)

RRID:SCR_012954

Software tool that screens DNA sequences for interspersed repeats and low complexity DNA sequences. The output of the program is a detailed annotation of the repeats that are present in the query sequence as well as a modified version of the query sequence in which all the annotated repeats have been masked (default: replaced by Ns). Currently over 56% of human genomic sequence is identified and masked by the program. Sequence comparisons in RepeatMasker are performed by one of several popular search engines including nhmmer, cross_match, ABBlast/WUBlast, RMBlast and Decypher. RepeatMasker makes use of curated libraries of repeats and currently supports Dfam ( profile HMM library ) and RepBase ( consensus sequence library ).

View all literature mentions

RSEM (tool)

RRID:SCR_013027

Software package for quantifying gene and isoform abundances from single end or paired end RNA Seq data. Accurate transcript quantification from RNA Seq data with or without reference genome. Used for accurate quantification of gene and isoform expression from RNA-Seq data.

View all literature mentions

Agilent Technologies (tool)

RRID:SCR_013575

Company provides laboratories worldwide with analytical instruments and supplies, clinical and diagnostic testing services, consumables, applications and expertise in life sciences and applied chemical markets.

View all literature mentions

Mascot (tool)

RRID:SCR_014322

A software package and server used to identify and characterize proteins from primary sequence databases using mass spectrometry data. Mascot integrates peptide mass fingerprinting, sequence querying, and MS/MS ion searching in order to search for proteins in databases like SwissProt, NCBInr, EMBL EST divisions, contaminants, and cRAP. If a license is purchased, users may: search data sets that exceed the 1200 spectrum limit of the free version; set up automated, high throughput work; add and edit proteins and quantification methods; and search a preferred collection of sequence databases. The software package works with instruments from AB Sciex, Agilent, Bruker, Jeol, Shimadzu, Thermo Scientific, and Waters.

View all literature mentions

MaxQuant (tool)

RRID:SCR_014485

A quantitative proteomics software package for analyzing large-scale mass-spectrometric data sets. It is a set of algorithms that include peak detection and scoring of peptides, mass calibration, database searches for protein identification, protein quantification, and provides summary statistics.

View all literature mentions

GENCODE (tool)

RRID:SCR_014966

Human and mouse genome annotation project which aims to identify all gene features in the human genome using computational analysis, manual annotation, and experimental validation.

View all literature mentions

Albacore (tool)

RRID:SCR_015897

Data processing basecaller for the Oxford Nanopore sequencer that identifies DNA sequences directly from raw data. It enhances accuracy of the single-read sequence data, contributing to high consensus accuracy for nanopore sequence data.

View all literature mentions

Minimap2 (tool)

RRID:SCR_018550

Software tool as pairwise alignment for nucleotide sequences. Alignment program to map DNA or long mRNA sequences against large reference database. Versatile pairwise aligner for genomic and spliced nucleotide sequences.

View all literature mentions

Genomic Data Commons Data Portal (GDC Data Portal) (tool)

RRID:SCR_014514

A unified data repository of the National Cancer Institute (NCI)'s Genomic Data Commons (GDC) that enables data sharing across cancer genomic studies in support of precision medicine. The GDC supports several cancer genome programs at the NCI Center for Cancer Genomics (CCG), including The Cancer Genome Atlas (TCGA), Therapeutically Applicable Research to Generate Effective Treatments (TARGET), and the Cancer Genome Characterization Initiative (CGCI). The GDC Data Portal provides a platform for efficiently querying and downloading high quality and complete data. The GDC also provides a GDC Data Transfer Tool and a GDC API for programmatic access.

View all literature mentions

Seqtk (tool)

RRID:SCR_018927

Software fast and lightweight tool for processing sequences in FASTA or FASTQ format.

View all literature mentions

A-549 (tool)

RRID:CVCL_0023

Cell line A-549 is a Cancer cell line with a species of origin Homo sapiens (Human)

View all literature mentions

PC-3 (tool)

RRID:CVCL_0035

Cell line PC-3 is a Cancer cell line with a species of origin Homo sapiens (Human)

View all literature mentions

PC-9 (tool)

RRID:CVCL_B260

Cell line PC-9 is a Cancer cell line with a species of origin Homo sapiens (Human)

View all literature mentions

NCI-H1299 (tool)

RRID:CVCL_0060

Cell line NCI-H1299 is a Cancer cell line with a species of origin Homo sapiens (Human)

View all literature mentions