A Variational Perspective on Generative Protein Fitness Optimization
Abstract
The goal of protein fitness optimization is to discover new protein variants with enhanced fitness for a given use. The vast search space and the sparsely populated fitness landscape, along with the discrete nature of protein sequences, pose significant challenges when trying to determine the gradient towards configurations with higher fitness. We introduce Variational Latent Generative Protein Optimization (VLGPO), a variational perspective on fitness optimization. Our method embeds protein sequences in a continuous latent space to enable efficient sampling from the fitness distribution and combines a (learned) flow matching prior over sequence mutations with a fitness predictor to guide optimization towards sequences with high fitness. VLGPO achieves state-of-the-art results on two different protein benchmarks of varying complexity. Moreover, the variational design with explicit prior and likelihood functions offers a flexible plug-and-play framework that can be easily customized to suit various protein design tasks.
Cite
Text
Bogensperger et al. "A Variational Perspective on Generative Protein Fitness Optimization." Proceedings of the 42nd International Conference on Machine Learning, 2025.Markdown
[Bogensperger et al. "A Variational Perspective on Generative Protein Fitness Optimization." Proceedings of the 42nd International Conference on Machine Learning, 2025.](https://mlanthology.org/icml/2025/bogensperger2025icml-variational/)BibTeX
@inproceedings{bogensperger2025icml-variational,
title = {{A Variational Perspective on Generative Protein Fitness Optimization}},
author = {Bogensperger, Lea and Narnhofer, Dominik and Allam, Ahmed and Schindler, Konrad and Krauthammer, Michael},
booktitle = {Proceedings of the 42nd International Conference on Machine Learning},
year = {2025},
pages = {4700-4712},
volume = {267},
url = {https://mlanthology.org/icml/2025/bogensperger2025icml-variational/}
}