Multi-Task Training Increases Native Sequence Recovery of Antigen-Specific T-Cell Receptor Sequences

Abstract

T-cells are a critical component of the adaptive immune system that use specialized T-cell receptors (TCRs) to bind non-self peptide fragments presented by major histocompatibility complex (MHC) molecules on the surface of other cells. Given their importance, a foundation model of TCR specificity that is capable of reliably mapping between TCR sequences and their cognate peptide-MHC (pMHC) ligands remains an unmet need. This study presents a key step towards developing a comprehensive foundation model by exploring the bi-directional mapping of both pMHCs to their corresponding TCRs, and vice versa. While validation performance was significantly worse in the TCR to pMHC direction given the highly asymmetric distribution of pMHC data, we find that the bidirectionally trained model outperformed the model trained in a single pMHC to TCR direction, at the cost of diversity. We work through a rigorous evaluation using well characterized pMHCs and present our framework and findings as a potential direction towards a unified generative foundation model of TCR:pMHC cross-reactivity.

Cite

Text

Karthikeyan and Rubinsteyn. "Multi-Task Training Increases Native Sequence Recovery of Antigen-Specific T-Cell Receptor Sequences." ICML 2024 Workshops: AccMLBio, 2024.

Markdown

[Karthikeyan and Rubinsteyn. "Multi-Task Training Increases Native Sequence Recovery of Antigen-Specific T-Cell Receptor Sequences." ICML 2024 Workshops: AccMLBio, 2024.](https://mlanthology.org/icmlw/2024/karthikeyan2024icmlw-multitask/)

BibTeX

@inproceedings{karthikeyan2024icmlw-multitask,
  title     = {{Multi-Task Training Increases Native Sequence Recovery of Antigen-Specific T-Cell Receptor Sequences}},
  author    = {Karthikeyan, Dhuvarakesh and Rubinsteyn, Alex},
  booktitle = {ICML 2024 Workshops: AccMLBio},
  year      = {2024},
  url       = {https://mlanthology.org/icmlw/2024/karthikeyan2024icmlw-multitask/}
}