Synergy of GFlowNet and Protein Language Model Makes a Diverse Antibody Designer

Abstract

Antibodies defend our health by binding to antigens with high specificity and potentiality, primarily relying on the Complementarity-Determining Region (CDR). Yet, current experimental methods of discovering new antibody CDRs are heavily time-consuming. Computational design could alleviate this burden; especially, protein language models have proven quite beneficial in many recent studies. However, most existing models solely focus on antibody potentiality and struggle to encapsulate the diverse range of plausible CDR candidates, limiting their effectiveness in real-world scenarios as binding is only one factor in the multitude of drug-forming criteria. In this paper, we introduce PG-AbD, a framework uniting Generative Flow Networks (GFlowNets) and pretrained Protein Language Models (PLMs) to successfully generate highly potent, diverse and novel antibody candidates. We innovatively construct a Products of Experts (PoE) composed by the global-distribution-modeling PLM and the local-distribution-modeling Potts Model to serve as the reward function of GFlowNet. The joint training paradigm is introduced, where PoE is trained by contrastive divergence with the negative samples generated by GFlowNet, and then guides GFlowNet to sample diverse antibody candidates. We evaluate PG-AbD on extensive antibody design benchmarks. It significantly outperforms existing methods in diversity (13.5% on RabDab, 31.1% on SabDab) while maintaining optimal potential and novelty. Generated antibodies are also found to form stable, regular 3D structures with their corresponding antigens, demonstrating the great potential of PG-AbD to accelerate real-world antibody discovery.

Cite

Text

Yin et al. "Synergy of GFlowNet and Protein Language Model Makes a Diverse Antibody Designer." AAAI Conference on Artificial Intelligence, 2025. doi:10.1609/AAAI.V39I21.34370

Markdown

[Yin et al. "Synergy of GFlowNet and Protein Language Model Makes a Diverse Antibody Designer." AAAI Conference on Artificial Intelligence, 2025.](https://mlanthology.org/aaai/2025/yin2025aaai-synergy/) doi:10.1609/AAAI.V39I21.34370

BibTeX

@inproceedings{yin2025aaai-synergy,
  title     = {{Synergy of GFlowNet and Protein Language Model Makes a Diverse Antibody Designer}},
  author    = {Yin, Mingze and Zhou, Hanjing and Zhu, Yiheng and Wu, Jialu and Wu, Wei and Li, Mingyang and Fu, Kun and Wang, Zheng and Hsieh, Chang-Yu and Hou, Tingjun and Wu, Jian},
  booktitle = {AAAI Conference on Artificial Intelligence},
  year      = {2025},
  pages     = {22164-22172},
  doi       = {10.1609/AAAI.V39I21.34370},
  url       = {https://mlanthology.org/aaai/2025/yin2025aaai-synergy/}
}