Boolean Satisfiability via Imitation Learning

Abstract

We propose ImitSAT, a branching policy for conflict-driven clause learning (CDCL) solvers based on imitation learning for the Boolean satisfiability problem (SAT). Unlike previous methods that predict instance-level signals to improve CDCL branching indirectly, or rely on reinforcement learning and insufficient CDCL information to enhance branching, ImitSAT learns from expert KeyTrace that collapses a full run into the sequence of surviving decisions. Replaying a KeyTrace on the same instance is nearly conflict-free, providing dense decision- level supervision and directly reducing propagations—the dominant contributor to wall-clock time. This prefix-conditioned supervision enables ImitSAT to reproduce high-quality branches without exploration, yielding faster convergence, stable training, and seamless integration into CDCL. Extensive experiments demonstrate that ImitSAT reduces propagation counts and runtime, outperforming state-of-the-art learned approaches. We released the source code and trained model at https://github.com/zewei-Zhang/ImitSAT.

Cite

Text

Zhang et al. "Boolean Satisfiability via Imitation Learning." International Conference on Learning Representations, 2026.

Markdown

[Zhang et al. "Boolean Satisfiability via Imitation Learning." International Conference on Learning Representations, 2026.](https://mlanthology.org/iclr/2026/zhang2026iclr-boolean/)

BibTeX

@inproceedings{zhang2026iclr-boolean,
  title     = {{Boolean Satisfiability via Imitation Learning}},
  author    = {Zhang, Zewei and Liu, Huan and Yu, Yuanhao and Chen, Jun and Xu, Xiangyu},
  booktitle = {International Conference on Learning Representations},
  year      = {2026},
  url       = {https://mlanthology.org/iclr/2026/zhang2026iclr-boolean/}
}