Searching across hundreds of databases

Our searching services are busy right now. Your search will reload in five seconds.

X
Forgot Password

If you have forgotten your password you can enter your email here and get a temporary password sent to your email.

X
Forgot Password

If you have forgotten your password you can enter your email here and get a temporary password sent to your email.

Global analysis of shared T cell specificities in human non-small cell lung cancer enables HLA inference and antigen discovery.

Immunity | 2021

To identify disease-relevant T cell receptors (TCRs) with shared antigen specificity, we analyzed 778,938 TCRβ chain sequences from 178 non-small cell lung cancer patients using the GLIPH2 (grouping of lymphocyte interactions with paratope hotspots 2) algorithm. We identified over 66,000 shared specificity groups, of which 435 were clonally expanded and enriched in tumors compared to adjacent lung. The antigenic epitopes of one such tumor-enriched specificity group were identified using a yeast peptide-HLA A∗02:01 display library. These included a peptide from the epithelial protein TMEM161A, which is overexpressed in tumors and cross-reactive epitopes from Epstein-Barr virus and E. coli. Our findings suggest that this cross-reactivity may underlie the presence of virus-specific T cells in tumor infiltrates and that pathogen cross-reactivity may be a feature of multiple cancers. The approach and analytical pipelines generated in this work, as well as the specificity groups defined here, present a resource for understanding the T cell response in cancer.

Pubmed ID: 33691136 RIS Download

Antibodies used in this publication

None found

Associated grants

  • Agency: NIAID NIH HHS, United States
    Id: R01 AI103867
  • Agency: NCRR NIH HHS, United States
    Id: S10 RR025468
  • Agency: NIAID NIH HHS, United States
    Id: U19 AI057229
  • Agency: NCI NIH HHS, United States
    Id: U54 CA232568
  • Agency: NCI NIH HHS, United States
    Id: P50 CA070907
  • Agency: NCRR NIH HHS, United States
    Id: S10 RR027431
  • Agency: NCI NIH HHS, United States
    Id: R01 CA234629
  • Agency: NCI NIH HHS, United States
    Id: P30 CA124435
  • Agency: NCRR NIH HHS, United States
    Id: S10 RR025518

Publication data is provided by the National Library of Medicine ® and PubMed ®. Data is retrieved from PubMed ® on a weekly schedule. For terms and conditions see the National Library of Medicine Terms and Conditions.

This is a list of tools and resources that we have found mentioned in this publication.


ATCC (tool)

RRID:SCR_001672

Global nonprofit biological resource center (BRC) and research organization that provides biological products, technical services and educational programs to private industry, government and academic organizations. Its mission is to acquire, authenticate, preserve, develop and distribute biological materials, information, technology, intellectual property and standards for the advancement and application of scientific knowledge. The primary purpose of ATCC is to use its resources and experience as a BRC to become the world leader in standard biological reference materials management, intellectual property resource management and translational research as applied to biomaterial development, standardization and certification. ATCC characterizes cell lines, bacteria, viruses, fungi and protozoa, as well as develops and evaluates assays and techniques for validating research resources and preserving and distributing biological materials to the public and private sector research communities.

View all literature mentions

GeneCopoeia (tool)

RRID:SCR_003145

Commercial organization which provides reagents and services for molecular biology research. Its services include clone collections, microRNA solutions, genome editing, qPCR products, and fluorescent labeling and detection.

View all literature mentions

HTSeq (tool)

RRID:SCR_005514

THIS RESOURCE IS NO LONGER IN SERVICE. Documented on February 28,2023. Software Python package that provides infrastructure to process data from high-throughput sequencing assays. While the main purpose of HTSeq is to allow you to write your own analysis scripts, customized to your needs, there are also a couple of stand-alone scripts for common tasks that can be used without any Python knowledge.

View all literature mentions

UniParc (tool)

RRID:SCR_005818

Database that contains publicly available protein sequences with stable and unique identifiers (UPI) which are never removed, changed or reassigned. UniParc tracks sequence changes in the source databases and archives the history of all changes. Information other than protein sequence must be retrieved from the UniParc source databases using the database cross-references.

View all literature mentions

Bioconductor (tool)

RRID:SCR_006442

Software repository for R packages related to analysis and comprehension of high throughput genomic data. Uses separate set of commands for installation of packages. Software project based on R programming language that provides tools for analysis and comprehension of high throughput genomic data.

View all literature mentions

Promega (tool)

RRID:SCR_006724

An Antibody supplier

View all literature mentions

Decombinator (tool)

RRID:SCR_006732

Software suite for analysis of T cell receptor repertoire data. Used for fast, efficient analysis of T cell receptor (TcR) repertoire samples, designed to be accessible to those with no previous programming experience.

View all literature mentions

PeproTech (tool)

RRID:SCR_006802

An Antibody supplier

View all literature mentions

Thermo Fisher Scientific (tool)

RRID:SCR_008452

Commercial vendor and service provider of laboratory reagents and antibodies. Supplier of scientific instrumentation, reagents and consumables, and software services.

View all literature mentions

QIAGEN (tool)

RRID:SCR_008539

A commercial organization which provides assay technologies to isolate DNA, RNA, and proteins from any biological sample. Assay technologies are then used to make specific target biomolecules, such as the DNA of a specific virus, visible for subsequent analysis.

View all literature mentions

IMGT - the international ImMunoGeneTics information system (tool)

RRID:SCR_012780

A high-quality integrated knowledge resource specialized in the immunoglobulins (IG) or antibodies, T cell receptors (TR), major histocompatibility complex (MHC) of human and other vertebrate species, and in the immunoglobulin superfamily (IgSF), MHC superfamily (MhcSF) and related proteins of the immune system (RPI) of vertebrates and invertebrates, serving as the global reference in immunogenetics and immunoinformatics. IMGT provides a common access to sequence, genome and structure Immunogenetics data, based on the concepts of IMGT-ONTOLOGY and on the IMGT Scientific chart rules. IMGT works in close collaboration with EBI (Europe), DDBJ (Japan) and NCBI (USA). IMGT consists of sequence databases, genome database, structure database, and monoclonal antibodies database, Web resources and interactive tools.

View all literature mentions

BD Biosciences (tool)

RRID:SCR_013311

An Antibody supplier

View all literature mentions

New England Biolabs (tool)

RRID:SCR_013517

An Antibody supplier

View all literature mentions

Genomic Data Commons Data Portal (GDC Data Portal) (tool)

RRID:SCR_014514

A unified data repository of the National Cancer Institute (NCI)'s Genomic Data Commons (GDC) that enables data sharing across cancer genomic studies in support of precision medicine. The GDC supports several cancer genome programs at the NCI Center for Cancer Genomics (CCG), including The Cancer Genome Atlas (TCGA), Therapeutically Applicable Research to Generate Effective Treatments (TARGET), and the Cancer Genome Characterization Initiative (CGCI). The GDC Data Portal provides a platform for efficiently querying and downloading high quality and complete data. The GDC also provides a GDC Data Transfer Tool and a GDC API for programmatic access.

View all literature mentions

TraCeR (tool)

RRID:SCR_016338

Software application for recovery of T cell receptor (TCR) data from single cell data. Used to reconstruct full-length, paired T cell receptor (TCR) sequences from T lymphocyte single-cell RNA sequence data. Links T cell specificity with functional response by revealing clonal relationships between cells alongside their transcriptional profiles.

View all literature mentions

Stanford University Vincent Coates Foundation Mass Spectrometry Laboratory Core Facility (tool)

RRID:SCR_017801

Core mass spec and proteomic services include open access lab for trained users with GC/MS, LC/MS, high resolution LC/MS, and MALDI-TOF instruments, help with intact protein analysis, targeted quantitation, drug discovery support, pathway analysis, protein interactions, FFPE tissue analysis, both labeled and label-free proteomics, and more. Please contact SUMS to discuss these and other custom projects including new application development.

View all literature mentions

CRAN (tool)

RRID:SCR_003005

Network of ftp and web servers around world that store identical, up to date, versions of code and documentation for R. Package archive network for R programming language.

View all literature mentions

Jurkat (tool)

RRID:CVCL_0065

Cell line Jurkat is a Cancer cell line with a species of origin Homo sapiens (Human)

View all literature mentions

HEK293T (tool)

RRID:CVCL_0063

Cell line HEK293T is a Transformed cell line with a species of origin Homo sapiens (Human)

View all literature mentions

HEK293T (tool)

RRID:CVCL_0063

Cell line HEK293T is a Transformed cell line with a species of origin Homo sapiens (Human)

View all literature mentions

Seurat (software resource)

RRID:SCR_016341

Software as R package designed for QC, analysis, and exploration of single cell RNA-seq data. Enable users to identify and interpret sources of heterogeneity from single cell transcriptomic measurements, and to integrate diverse types of single cell data.

View all literature mentions

BLASTP (service resource)

RRID:SCR_001010

Data analysis service whose programs search protein databases using a protein query. The algorithms used include blastp, psi-blast, phi-blast, and delta-blast.

View all literature mentions

NetMHCpan Server (software resource)

RRID:SCR_018182

Web server for quantitative prediction of peptide binding to any MHC molecule of known sequence using artificial neural networks. Characterizes binding specificity of given major histocompatibility complex molecule and predicts peptide length profile and peptide binding affinity. NetMHCpan 3.0 is improved prediction of binding to MHC class I molecules integrating information from multiple receptor and peptide length data sets. NetMHCpan 4.0 is trained on naturally eluted ligands and on peptide binding affinity data.

View all literature mentions

Monocle3 (software resource)

RRID:SCR_018685

Software analysis toolkit for single cell RNA-seq. Used for single cell RNA-Seq experiments. Unsupervised algorithm that increases temporal resolution of transcriptome dynamics using single-cell RNA-Seq data collected at multiple time points.

View all literature mentions

Strelka (software resource)

RRID:SCR_005109

Software for somatic single nucleotide variant (SNV) and small indel detection from sequencing data of matched tumor-normal samples. The method employs a novel Bayesian approach which represents continuous allele frequencies for both tumor and normal samples, whilst leveraging the expected genotype structure of the normal. This is achieved by representing the normal sample as a mixture of germline variation with noise, and representing the tumor sample as a mixture of the normal sample with somatic variation. A natural consequence of the model structure is that sensitivity can be maintained at high tumor impurity without requiring purity estimates. The method has superior accuracy and sensitivity on impure samples compared to approaches based on either diploid genotype likelihoods or general allele-frequency tests.

View all literature mentions

MuTect (software resource)

RRID:SCR_000559

Software for the reliable and accurate identification of somatic point mutations in next generation sequencing data of cancer genomes.

View all literature mentions

VARSCAN (software resource)

RRID:SCR_006849

A platform-independent, technology-independent software tool for identifying SNPs and indels in massively parallel sequencing of individual and pooled samples. Given data for a single sample, VarScan identifies and filters germline variants based on read counts, base quality, and allele frequency. Given data for a tumor-normal pair, VarScan also determines the somatic status of each variant (Germline, Somatic, or LOH) by comparing read counts between samples. (entry from Genetic Analysis Software)

View all literature mentions

FlowJo (software resource)

RRID:SCR_008520

Software for single-cell flow cytometry analysis. Its functions include management, display, manipulation, analysis and publication of the data stream produced by flow and mass cytometers.

View all literature mentions

IMGT HighV-QUEST (production service resource)

RRID:SCR_018196

Next generation B and T cell sequence alignment and characterization online surface by IMGT. Web portal for immunoglobulin (IG) or antibody and T cell receptor (TR) analysis from NGS high throughput and deep sequencing.

View all literature mentions

Fiji (software resource)

RRID:SCR_002285

Software package as distribution of ImageJ and ImageJ2 together with Java, Java3D and plugins organized into coherent menu structure. Used to assist research in life sciences.

View all literature mentions

Gene Set Enrichment Analysis (software resource)

RRID:SCR_003199

Software package for interpreting gene expression data. Used for interpretation of a large-scale experiment by identifying pathways and processes.

View all literature mentions

htseq-count (software resource)

RRID:SCR_011867

Script distributed with the HT-Seq Python framework for processing RNA-seq or DNA-seq data.

View all literature mentions

Python Programming Language (software resource)

RRID:SCR_008394

Programming language for all operating systems that lets users work more quickly and integrate their systems more effectively. Often compared to Tcl, Perl, Ruby, Scheme or Java. Some of its key distinguishing features include very clear and readable syntax, strong introspection capabilities, intuitive object orientation, natural expression of procedural code, full modularity, exception-based error handling, high level dynamic data types, extensive standard libraries and third party modules for virtually every task, extensions and modules easily written in C, C (or Java for Python, or .NET languages for IronPython), and embeddable within applications as a scripting interface.

View all literature mentions

SAMTOOLS (software resource)

RRID:SCR_002105

Original SAMTOOLS package has been split into three separate repositories including Samtools, BCFtools and HTSlib. Samtools for manipulating next generation sequencing data used for reading, writing, editing, indexing,viewing nucleotide alignments in SAM,BAM,CRAM format. BCFtools used for reading, writing BCF2,VCF, gVCF files and calling, filtering, summarising SNP and short indel sequence variants. HTSlib used for reading, writing high throughput sequencing data.

View all literature mentions

STAR (software resource)

RRID:SCR_004463

Software performing alignment of high-throughput RNA-seq data. Aligns RNA-seq reads to reference genome using uncompressed suffix arrays.

View all literature mentions

R Project for Statistical Computing (software resource)

RRID:SCR_001905

Software environment and programming language for statistical computing and graphics. R is integrated suite of software facilities for data manipulation, calculation and graphical display. Can be extended via packages. Some packages are supplied with the R distribution and more are available through CRAN family.It compiles and runs on wide variety of UNIX platforms, Windows and MacOS.

View all literature mentions

NCI-H1395 (cell line)

RRID:CVCL_1467

Cell line NCI-H1395 is a Cancer cell line with a species of origin Homo sapiens (Human)

View all literature mentions

T2 (cell line)

RRID:CVCL_2211

Cell line T2 is a Hybrid cell line with a species of origin Homo sapiens

View all literature mentions