TCRGenesis: Generation of SIINFEKL-Specific T-Cell Receptor Sequences Using Autoregressive Transformer
Abstract
Engineered T-cell therapies are a promising new approach for treating previously uncurable diseases. These therapies involve genetically modified T cells expressing custom T cell receptors (TCRs) that recognize antigens from cancer, virus-infected, or autoimmune cells. However, the identification or generation of suitable TCRs remains an unsolved challenge. Computational methods hold the potential to accelerate the development of TCRs binding towards target antigens. While the computational investigation of the TCR-epitope landscape has been mainly focused on binding prediction, synthetic TCR design has recently emerged as the next frontier. Here, we present a proof-of-concept study on generating full TCR sequences reactive to a fixed epitope $\textit{in silico}$. Towards this, we utilized a unique dataset comprising thousands of TCRs experimentally validated as reactive towards the model epitope-MHC complex SIINFEKL/H2-K$^b$ and a naive TCR background to train our autoregressive transformer model TCRGenesis. The model generated a repertoire of realistic TCRs as validated through various biophysical and sequence properties. Further, the sequences exhibited high binding scores according to a predictor specifically developed for evaluation. The generator inherently captured the rules governing binding towards SIINFEKL as its perplexity score assigned to real, unseen TCR sequences separates well between binding and non-binding TCRs, and the generated sequences resembled binders. This work marks one of the first steps in the full-sequence design of TCRs specific to an antigen $\textit{in silico}$, which we envision will accelerate the development of future immunotherapies and personalized medicine through rapid and reliable TCR synthesis.
Cite
Text
An et al. "TCRGenesis: Generation of SIINFEKL-Specific T-Cell Receptor Sequences Using Autoregressive Transformer." NeurIPS 2024 Workshops: AIDrugX, 2024.Markdown
[An et al. "TCRGenesis: Generation of SIINFEKL-Specific T-Cell Receptor Sequences Using Autoregressive Transformer." NeurIPS 2024 Workshops: AIDrugX, 2024.](https://mlanthology.org/neuripsw/2024/an2024neuripsw-tcrgenesis/)BibTeX
@inproceedings{an2024neuripsw-tcrgenesis,
title = {{TCRGenesis: Generation of SIINFEKL-Specific T-Cell Receptor Sequences Using Autoregressive Transformer}},
author = {An, Yang and Drost, Felix and Straub, Adrian and Marsico, Annalisa and Busch, Dirk H and Schubert, Benjamin},
booktitle = {NeurIPS 2024 Workshops: AIDrugX},
year = {2024},
url = {https://mlanthology.org/neuripsw/2024/an2024neuripsw-tcrgenesis/}
}