FlexRibbon: Joint Sequence and Structure Pretraining for Protein Modeling

Abstract

Protein foundation models have advanced rapidly, with most approaches falling into two dominant paradigms. Sequence-based language models (e.g., ESM-2) capture sequence semantics at scale, and a number of recent works incorporate structural signals into sequence encoders. MSA-based predictors (e.g., AlphaFold 2/3) achieve accurate folding by exploiting evolutionary couplings, but their reliance on homologous sequences makes them less reliable in highly mutated or alignment-sparse regimes. We present FlexRibbon, a pretrained protein model that jointly learns from amino acid sequences and three-dimensional structures. Our pretraining strategy combines masked language modeling with diffusion-based denoising, enabling bidirectional sequence-structure learning without requiring MSAs. Trained on both experimentally resolved structures and AlphaFold 2 predictions, FlexRibbon captures global folds as well as flexible conformations critical for biological function. Evaluated across diverse tasks spanning interface design, intermolecular interaction prediction, and protein function prediction, FlexRibbon establishes new state-of-the-art performance on 12 different tasks, with particularly strong gains in mutation-rich settings where MSA-based methods often struggle.

Cite

Text

Zhu et al. "FlexRibbon: Joint Sequence and Structure Pretraining for Protein Modeling." International Conference on Learning Representations, 2026.

Markdown

[Zhu et al. "FlexRibbon: Joint Sequence and Structure Pretraining for Protein Modeling." International Conference on Learning Representations, 2026.](https://mlanthology.org/iclr/2026/zhu2026iclr-flexribbon/)

BibTeX

@inproceedings{zhu2026iclr-flexribbon,
  title     = {{FlexRibbon: Joint Sequence and Structure Pretraining for Protein Modeling}},
  author    = {Zhu, Jianwei and Shi, Yu and Bi, Ran and Jin, Peiran and Liu, Chang and Zhang, Zhe and Huang, Haitao and Guo, Zekun and Hu, Pipi and Ju, Fusong and Huang, Lin and Tai, Xinwei and Li, Chenao and Gao, Kaiyuan and Wei, Xinran and Xia, Huanhuan and Zhang, Jia and Min, Yaosen and Wang, Zun and Wang, Yusong and He, Liang and Liu, Haiguang and Qin, Tao},
  booktitle = {International Conference on Learning Representations},
  year      = {2026},
  url       = {https://mlanthology.org/iclr/2026/zhu2026iclr-flexribbon/}
}