Protein Language Models Expose Viral Mimicry and Immune Escape
Abstract
Viruses elude the immune system through molecular mimicry, adopting their hosts biophysical characteristics. We adapt protein language models (PLMs) to differenti-ate between human and viral proteins. Understanding where the immune system and our models make mistakes could reveal viral immune escape mechanisms. We ap-plied pretrained PLMs to predict viral from human pro-teins. achieving a state-of-the-art results (99.7% ROCAUC). We use interpretable models to characterize viral escapers. Altogether, mistakes account for 3.9% of the sequences with viral proteins being disproportionally misclassified. Errors often involve proteins with low im-munogenic potential, human specific viruses, and reverse transcriptases. Viral families causing chronic infections and immune evasion are further enriched. Biological and ML models make similar mistakes. Integrating PLMs with explainable AI, we provide novel insights into viral im-mune escape mechanisms, enhancing strategies for vac-cine development and antiviral research.
Cite
Text
Ofer and Linial. "Protein Language Models Expose Viral Mimicry and Immune Escape." ICML 2024 Workshops: ML4LMS, 2024.Markdown
[Ofer and Linial. "Protein Language Models Expose Viral Mimicry and Immune Escape." ICML 2024 Workshops: ML4LMS, 2024.](https://mlanthology.org/icmlw/2024/ofer2024icmlw-protein/)BibTeX
@inproceedings{ofer2024icmlw-protein,
title = {{Protein Language Models Expose Viral Mimicry and Immune Escape}},
author = {Ofer, Dan and Linial, Michal},
booktitle = {ICML 2024 Workshops: ML4LMS},
year = {2024},
url = {https://mlanthology.org/icmlw/2024/ofer2024icmlw-protein/}
}