Reaction Prediction via Interaction Modeling of Symmetric Difference Shingle Sets
Abstract
Chemical reaction prediction remains a fundamental challenge in organic chemistry, where existing machine learning models face two critical limitations: sensitivity to input permutations (molecule/atom orderings) and inadequate modeling of substructural interactions governing reactivity. These shortcomings lead to inconsistent predictions and poor generalization to real-world scenarios. To address these challenges, we propose ReaDISH, a novel reaction prediction model that learns permutation-invariant representations while incorporating interaction-aware features. It introduces two innovations: (1) symmetric difference shingle encoding, which extends the differential reaction fingerprint (DRFP) by representing shingles as continuous high-dimensional embeddings, capturing structural changes while eliminating order sensitivity; and (2) geometry-structure interaction attention, a mechanism that models intra- and inter-molecular interactions at the shingle level. Extensive experiments demonstrate that ReaDISH improves reaction prediction performance across diverse benchmarks. It shows enhanced robustness with an average improvement of 8.76\% on R$^2$ under permutation perturbations.
Cite
Text
Shi et al. "Reaction Prediction via Interaction Modeling of Symmetric Difference Shingle Sets." Advances in Neural Information Processing Systems, 2025.Markdown
[Shi et al. "Reaction Prediction via Interaction Modeling of Symmetric Difference Shingle Sets." Advances in Neural Information Processing Systems, 2025.](https://mlanthology.org/neurips/2025/shi2025neurips-reaction/)BibTeX
@inproceedings{shi2025neurips-reaction,
title = {{Reaction Prediction via Interaction Modeling of Symmetric Difference Shingle Sets}},
author = {Shi, Runhan and Chen, Letian and Yu, Gufeng and Yang, Yang},
booktitle = {Advances in Neural Information Processing Systems},
year = {2025},
url = {https://mlanthology.org/neurips/2025/shi2025neurips-reaction/}
}