RankFlow: Property-Aware Transport for Protein Optimization

Abstract

A key step in protein optimization is modeling the fitness landscape, which maps proteins to functional assay readouts. Existing methods typically either use property-agnostic likelihoods/embeddings from pretrained protein language models (PLMs) for fitness prediction, or assume independent mutational effects, limiting their ability to capture higher-order interactions. In this work, we introduce RankFlow, a conditional flow framework that refines PLM representations to be a property-aligned distribution via a tailored energy function and captures multi-mutation interactions through learnable embeddings. To align optimization with evaluation protocols, we propose the Rank-Consistent Conditional Flow Loss (RC$^2$), a differentiable ranking objective that enforces the correct order of mutants rather than absolute values, which improves out-of-distribution generalization. Finally, we introduce a Property-guided Steering Gate (PSG) that concentrates learning on positions carrying signals for the target property while suppressing unrelated evolutionary biases. Across the ProteinGym, PEER, and FLIP benchmarks, RankFlow obtains state-of-the-art ranking accuracy and superior generalization performance.

Cite

Text

Yu et al. "RankFlow: Property-Aware Transport for Protein Optimization." International Conference on Learning Representations, 2026.

Markdown

[Yu et al. "RankFlow: Property-Aware Transport for Protein Optimization." International Conference on Learning Representations, 2026.](https://mlanthology.org/iclr/2026/yu2026iclr-rankflow/)

BibTeX

@inproceedings{yu2026iclr-rankflow,
  title     = {{RankFlow: Property-Aware Transport for Protein Optimization}},
  author    = {Yu, Lu and Xiang, Wei and Han, Kang and Liu, Gaowen and Kompella, Ramana Rao},
  booktitle = {International Conference on Learning Representations},
  year      = {2026},
  url       = {https://mlanthology.org/iclr/2026/yu2026iclr-rankflow/}
}