A Theoretically-Principled Sparse, Connected, and Rigid Graph Representation of Molecules

Abstract

Graph neural networks (GNNs) -- learn graph representations by exploiting the graph's sparsity, connectivity, and symmetries -- have become indispensable for learning geometric data like molecules. However, the most used graphs (e.g., radial cutoff graphs) in molecular modeling lack theoretical guarantees for achieving connectivity and sparsity simultaneously, which are essential for the performance and scalability of GNNs. Furthermore, existing widely used graph construction methods for molecules lack rigidity, limiting GNNs' ability to exploit graph nodes' spatial arrangement. In this paper, we introduce a new hyperparameter-free graph construction of molecules and beyond with sparsity, connectivity, and rigidity guarantees. Remarkably, our method consistently generates connected and sparse graphs with the edge-to-node ratio being bounded above by 3. Our graphs' rigidity guarantees that edge distances and dihedral angles are sufficient to uniquely determine the general spatial arrangements of atoms. We substantiate the effectiveness and efficiency of our proposed graphs in various molecular modeling benchmarks. Code is available at https://github.com/shihhsinwang0214/SCHull.

Cite

Text

Wang et al. "A Theoretically-Principled Sparse, Connected, and Rigid Graph Representation of Molecules." International Conference on Learning Representations, 2025.

Markdown

[Wang et al. "A Theoretically-Principled Sparse, Connected, and Rigid Graph Representation of Molecules." International Conference on Learning Representations, 2025.](https://mlanthology.org/iclr/2025/wang2025iclr-theoreticallyprincipled/)

BibTeX

@inproceedings{wang2025iclr-theoreticallyprincipled,
  title     = {{A Theoretically-Principled Sparse, Connected, and Rigid Graph Representation of Molecules}},
  author    = {Wang, Shih-Hsin and Huang, Yuhao and Baker, Justin M. and Sun, Yuan-En and Tang, Qi and Wang, Bao},
  booktitle = {International Conference on Learning Representations},
  year      = {2025},
  url       = {https://mlanthology.org/iclr/2025/wang2025iclr-theoreticallyprincipled/}
}