Sidechain Conditioning and Modeling for Full-Atom Protein Sequence Design with FAMPNN

Abstract

Leading deep learning-based methods for fixed-backbone protein sequence design do not model protein sidechain conformation during sequence generation despite the large role the three-dimensional arrangement of sidechain atoms play in protein conformation, stability, and overall protein function. Instead, these models implicitly reason about crucial sidechain interactions based solely on backbone geometry and amino-acid sequence. To address this, we present FAMPNN (Full-Atom MPNN), a sequence design method that explicitly models both sequence identity and sidechain conformation for each residue, where the per-token distribution of a residue’s discrete amino acid identity and its continuous sidechain conformation are learned with a combined categorical cross-entropy and diffusion loss objective. We demonstrate learning these distributions jointly is a highly synergistic task that both improves sequence recovery while achieving state-of-the-art sidechain packing. Furthermore, benefits from explicit full-atom modeling generalize from sequence recovery to practical protein design applications, such as zero-shot prediction of experimental binding and stability measurements.

Cite

Text

Widatalla et al. "Sidechain Conditioning and Modeling for Full-Atom Protein Sequence Design with FAMPNN." Proceedings of the 42nd International Conference on Machine Learning, 2025.

Markdown

[Widatalla et al. "Sidechain Conditioning and Modeling for Full-Atom Protein Sequence Design with FAMPNN." Proceedings of the 42nd International Conference on Machine Learning, 2025.](https://mlanthology.org/icml/2025/widatalla2025icml-sidechain/)

BibTeX

@inproceedings{widatalla2025icml-sidechain,
  title     = {{Sidechain Conditioning and Modeling for Full-Atom Protein Sequence Design with FAMPNN}},
  author    = {Widatalla, Talal and Shuai, Richard W. and Hie, Brian and Huang, Possu},
  booktitle = {Proceedings of the 42nd International Conference on Machine Learning},
  year      = {2025},
  pages     = {66746-66771},
  volume    = {267},
  url       = {https://mlanthology.org/icml/2025/widatalla2025icml-sidechain/}
}