Advancing Retrosynthesis with Retrieval-Augmented Graph Generation

Abstract

Diffusion-based molecular graph generative models have achieved significant success in template-free, single-step retrosynthesis prediction. However, these models typically generate reactants from scratch, often overlooking the fact that the scaffold of a product molecule typically remains unchanged during chemical reactions. To leverage this useful observation, we introduce a retrieval-augmented molecular graph generation framework. Our framework comprises three key components: a retrieval component that identifies similar molecules for the given product, an integration component that learns valuable clues from these molecules about which part of the product should remain unchanged, and a base generative model that is prompted by these clues to generate the corresponding reactants. We explore various design choices for critical and under-explored aspects of this framework and instantiate it as the Retrieval-Augmented RetroBridge (RARB). RARB demonstrates state-of-the-art performance on standard benchmarks, achieving a 14.8% relative improvement in top-1 accuracy over its base generative model, highlighting the effectiveness of retrieval augmentation. Additionally, RARB excels in handling out-of-distribution molecules, and its advantages remain significant even with smaller models or fewer denoising steps. These strengths make RARB highly valuable for real-world retrosynthesis applications, where extrapolation to novel molecules and high-throughput prediction are essential.

Cite

Text

Qiao et al. "Advancing Retrosynthesis with Retrieval-Augmented Graph Generation." AAAI Conference on Artificial Intelligence, 2025. doi:10.1609/AAAI.V39I19.34203

Markdown

[Qiao et al. "Advancing Retrosynthesis with Retrieval-Augmented Graph Generation." AAAI Conference on Artificial Intelligence, 2025.](https://mlanthology.org/aaai/2025/qiao2025aaai-advancing/) doi:10.1609/AAAI.V39I19.34203

BibTeX

@inproceedings{qiao2025aaai-advancing,
  title     = {{Advancing Retrosynthesis with Retrieval-Augmented Graph Generation}},
  author    = {Qiao, Anjie and Wang, Zhen and Rao, Jiahua and Yang, Yuedong and Wei, Zhewei},
  booktitle = {AAAI Conference on Artificial Intelligence},
  year      = {2025},
  pages     = {20004-20013},
  doi       = {10.1609/AAAI.V39I19.34203},
  url       = {https://mlanthology.org/aaai/2025/qiao2025aaai-advancing/}
}