Steering Protein Family Design Through Profile Bayesian Flow

Abstract

Protein family design emerges as a promising alternative by combining the advantages of de novo protein design and mutation-based directed evolution.In this paper, we propose ProfileBFN, the Profile Bayesian Flow Networks, for specifically generative modeling of protein families. ProfileBFN extends the discrete Bayesian Flow Network from an MSA profile perspective, which can be trained on single protein sequences by regarding it as a degenerate profile, thereby achieving efficient protein family design by avoiding large-scale MSA data construction and training. Empirical results show that ProfileBFN has a profound understanding of proteins. When generating diverse and novel family proteins, it can accurately capture the structural characteristics of the family. The enzyme produced by this method is more likely than the previous approach to have the corresponding function, offering better odds of generating diverse proteins with the desired functionality.

Cite

Text

Gong et al. "Steering Protein Family Design Through Profile Bayesian Flow." International Conference on Learning Representations, 2025.

Markdown

[Gong et al. "Steering Protein Family Design Through Profile Bayesian Flow." International Conference on Learning Representations, 2025.](https://mlanthology.org/iclr/2025/gong2025iclr-steering/)

BibTeX

@inproceedings{gong2025iclr-steering,
  title     = {{Steering Protein Family Design Through Profile Bayesian Flow}},
  author    = {Gong, Jingjing and Pei, Yu and Long, Siyu and Song, Yuxuan and Zhang, Zhe and Huang, Wenhao and Cao, Ziyao and Zhang, Shuyi and Zhou, Hao and Ma, Wei-Ying},
  booktitle = {International Conference on Learning Representations},
  year      = {2025},
  url       = {https://mlanthology.org/iclr/2025/gong2025iclr-steering/}
}