Integrating Protein Language Model and Active Learning for Few-Shot Viral Variant Detection
Abstract
Early detection of high-fitness SARS-CoV-2 variants is crucial for pandemic response, yet limited experimental resources hinder timely identification. We propose an active learning framework that integrates a protein language model, a Gaussian process with uncertainty estimation, and a biophysical model to predict the fitness of novel receptor-binding domain (RBD) variants in a few-shot learning setting. Our approach prioritizes the most informative variants for experimental characterization, accelerating high-fitness variant detection by up to 5× compared to random sampling while testing fewer than 1\% of all possible variants. Benchmarking on deep mutational scans, we show that our method identifies evolutionarily significant sites, particularly those facilitating antibody escape. We systematically compare different acquisition strategies and demonstrate that incorporating uncertainty-driven exploration enhances coverage of the mutational landscape, enabling the discovery of evolutionarily distant yet high-risk variants. Our results suggest that this framework could serve as an efficient early warning system for identifying concerning SARS-CoV-2 variants before they achieve widespread circulation.
Cite
Text
Huot et al. "Integrating Protein Language Model and Active Learning for Few-Shot Viral Variant Detection." ICLR 2025 Workshops: LMRL, 2025.Markdown
[Huot et al. "Integrating Protein Language Model and Active Learning for Few-Shot Viral Variant Detection." ICLR 2025 Workshops: LMRL, 2025.](https://mlanthology.org/iclrw/2025/huot2025iclrw-integrating/)BibTeX
@inproceedings{huot2025iclrw-integrating,
title = {{Integrating Protein Language Model and Active Learning for Few-Shot Viral Variant Detection}},
author = {Huot, Marian and Wang, Dianzhuo and Liu, Jiacheng and Shakhnovich, Eugene},
booktitle = {ICLR 2025 Workshops: LMRL},
year = {2025},
url = {https://mlanthology.org/iclrw/2025/huot2025iclrw-integrating/}
}