Graph-Based Symbolic Regression with Invariance and Constraint Encoding

Abstract

Symbolic regression (SR) seeks interpretable analytical expressions that uncover the governing relationships within data, providing mechanistic insight beyond 'black-box' models. However, existing SR methods often suffer from two key limitations: (1) *redundant representations* that fail to capture mathematical equivalences and higher-order operand relations, breaking permutation invariance and hindering efficient learning; and (2) *sparse rewards* caused by incomplete incorporation of constraints that can only be evaluated on full expressions, such as constant fitting or physical-law verification. To address these challenges, we propose a unified framework, **Graph-based Symbolic Regression (GSR)**, which compresses the search space through the permutation-invariant representations, expression graphs (EGs), that intrinsically encode expression equivalences via a term-rewriting system (TRS) and a directed acyclic graph (DAG) structure. GSR mitigates reward sparsity by employing a hybrid neural-guided Monte Carlo tree search (hnMCTS) on EGs, where constraint-informed neural guidance enables the direct incorporation of expression-level constraint priors, and an adaptive $\epsilon$-UCB policy balances exploration and exploitation. Theoretical analyses establish the uniqueness of our proposed EG representation and the convergence of the hnMCTS algorithm. Experiments on synthetic and real-world scientific datasets demonstrate the efficiency and accuracy of GSR in discovering underlying expressions and adhering to physical laws, offering practical solutions for scientific discovery.

Cite

Text

Xiang et al. "Graph-Based Symbolic Regression with Invariance and Constraint Encoding." Advances in Neural Information Processing Systems, 2025.

Markdown

[Xiang et al. "Graph-Based Symbolic Regression with Invariance and Constraint Encoding." Advances in Neural Information Processing Systems, 2025.](https://mlanthology.org/neurips/2025/xiang2025neurips-graphbased/)

BibTeX

@inproceedings{xiang2025neurips-graphbased,
  title     = {{Graph-Based Symbolic Regression with Invariance and Constraint Encoding}},
  author    = {Xiang, Ziyu and Ashen, Kenna and Qian, Xiaofeng and Qian, Xiaoning},
  booktitle = {Advances in Neural Information Processing Systems},
  year      = {2025},
  url       = {https://mlanthology.org/neurips/2025/xiang2025neurips-graphbased/}
}