Alternate Geometric and Semantic Denoising Diffusion for Protein Inverse Folding
Abstract
Protein inverse folding is a fundamental problem in bioinformatics, aiming to recover the amino acid sequences from a given protein backbone structure. Despite the success of existing methods, they still have two limitations: (1) widely used topological modeling via GNNs may not effectively integrate geometric context of the entire protein 3D structure by focusing on only local residue message passing, and (2) current denoising processes primarily rely on geometric relations to update residue representations, while neglecting the semantic and functional correlations between different amino acid types. In this work, we propose an Alternate Geometric and Semantic Denoising Diffusion ( $\mathop {\mathrm {{\textbf {AGSDD}}}}\limits $ AGSDD ) that performs two types of denoising, i.e., geometric denoising and semantic denoising in turn, in the joint Geo-semantic residue representation space: (1) the geometric denoising module uses a geometric contextual aggregator to encode global contextual information from the entire protein structure and selectively distributes information to each residue; and (2) the semantic denoising module uses a learnable key-value dictionary of residue-types to facilitate communication between them so that learned residue features can be more accurately aligned to proper residue types. In experiments, we conduct extensive evaluations on the CATH4.2, TS50 and TS500 datasets, and observe that even without using any pre-trained protein language models, $\mathop {\mathrm {{\textbf {AGSDD}}}}\limits $ AGSDD still outperforms leading methods, achieving state-of-the-art performance and exhibiting strong generalization capabilities.
Cite
Text
Wang et al. "Alternate Geometric and Semantic Denoising Diffusion for Protein Inverse Folding." European Conference on Machine Learning and Principles and Practice of Knowledge Discovery in Databases, 2025. doi:10.1007/978-3-032-06066-2_21Markdown
[Wang et al. "Alternate Geometric and Semantic Denoising Diffusion for Protein Inverse Folding." European Conference on Machine Learning and Principles and Practice of Knowledge Discovery in Databases, 2025.](https://mlanthology.org/ecmlpkdd/2025/wang2025ecmlpkdd-alternate/) doi:10.1007/978-3-032-06066-2_21BibTeX
@inproceedings{wang2025ecmlpkdd-alternate,
title = {{Alternate Geometric and Semantic Denoising Diffusion for Protein Inverse Folding}},
author = {Wang, Chenglin and Zhou, Yucheng and Wang, Zhe and Zhai, Zijie and Shen, Jianbing and Zhang, Kai},
booktitle = {European Conference on Machine Learning and Principles and Practice of Knowledge Discovery in Databases},
year = {2025},
pages = {350-366},
doi = {10.1007/978-3-032-06066-2_21},
url = {https://mlanthology.org/ecmlpkdd/2025/wang2025ecmlpkdd-alternate/}
}