Procedural Synthesis of Synthesizable Molecules
Abstract
Designing synthetically accessible molecules and recommending analogs to unsynthesizable molecules are important problems for accelerating molecular discovery. We reconceptualize both problems using ideas from program synthesis. Drawing inspiration from syntax-guided synthesis approaches, we decouple the syntactic skeleton from the semantics of a synthetic tree to create a bilevel framework for reasoning about the combinatorial space of synthesis pathways. Given a molecule we aim to generate analogs for, we iteratively refine its skeletal characteristics via Markov Chain Monte Carlo simulations over the space of syntactic skeletons. Given a black-box oracle to optimize, we formulate a joint design space over syntactic templates and molecular descriptors and introduce evolutionary algorithms that optimize both syntactic and semantic dimensions synergistically. Our key insight is that once the syntactic skeleton is set, we can amortize over the search complexity of deriving the program's semantics by training policies to fully utilize the fixed horizon Markov Decision Process imposed by the syntactic template. We demonstrate performance advantages of our bilevel framework for synthesizable analog generation and synthesizable molecule design. Notably, our approach offers the user explicit control over the resources required to perform synthesis and biases the design space towards simpler solutions, making it particularly promising for autonomous synthesis platforms. Supporting code is at https://github.com/shiningsunnyday/SynthesisNet.
Cite
Text
Sun et al. "Procedural Synthesis of Synthesizable Molecules." International Conference on Learning Representations, 2025.Markdown
[Sun et al. "Procedural Synthesis of Synthesizable Molecules." International Conference on Learning Representations, 2025.](https://mlanthology.org/iclr/2025/sun2025iclr-procedural/)BibTeX
@inproceedings{sun2025iclr-procedural,
title = {{Procedural Synthesis of Synthesizable Molecules}},
author = {Sun, Michael and Lo, Alston and Guo, Minghao and Chen, Jie and Coley, Connor W. and Matusik, Wojciech},
booktitle = {International Conference on Learning Representations},
year = {2025},
url = {https://mlanthology.org/iclr/2025/sun2025iclr-procedural/}
}