SIRD: Symbolic Integration Rules Dataset
Abstract
Advancements in neural networks and computer hardware lead to new use cases for deep learning in the natural sciences every day. Even though symbolic mathematics tasks have been explored, symbolic integration only has a few studies using black box models and currently lacks explainability. Symbolic integration is a challenging search problem and the final result is obtained by applying different integration rules at each step. We propose a novel and interpretable approach to perform symbolic integration using deep learning through integral rule prediction to speed up the search. We introduce the first-of-its-kind symbolic integration rules dataset comprising two million distinct functions and integration rule pairs. For complex rules such as u-substitution and integration by parts, it also includes the expression needed for rule application. We also train a transformer model on our proposed dataset and incorporate it into SymPy's integral\_steps function to get guided\_integral\_steps, resulting in $6\times$ fewer branches explored by allowing our model to guide the depth-first-search procedure.
Cite
Text
Sharma et al. "SIRD: Symbolic Integration Rules Dataset." NeurIPS 2023 Workshops: MATH-AI, 2023.Markdown
[Sharma et al. "SIRD: Symbolic Integration Rules Dataset." NeurIPS 2023 Workshops: MATH-AI, 2023.](https://mlanthology.org/neuripsw/2023/sharma2023neuripsw-sird/)BibTeX
@inproceedings{sharma2023neuripsw-sird,
title = {{SIRD: Symbolic Integration Rules Dataset}},
author = {Sharma, Vaibhav and Nagpal, Abhinav and Balin, Muhammed Fatih},
booktitle = {NeurIPS 2023 Workshops: MATH-AI},
year = {2023},
url = {https://mlanthology.org/neuripsw/2023/sharma2023neuripsw-sird/}
}