Learning Efficient Recursive Numeral Systems via Reinforcement Learning
Abstract
The emergence of mathematical concepts, such as number systems, is an understudied area in AI for mathematics and reasoning. It has previously been shown (Carlsson et al., 2021) that by using reinforcement learning (RL), agents can derive simple approximate and exact-restricted numeral systems. However, it is a major challenge to show how more complex recursive numeral systems, similar to the one utilised in English, could arise via a simple learning mechanism such as RL. Here, we introduce an approach towards deriving a mechanistic explanation of the emergence of recursive number systems where we consider an RL agent which directly optimizes a lexicon under a given meta-grammar. Utilising a slightly modified version of the seminal meta-grammar of (Hurford, 1975), we demonstrate that our RL agent can effectively modify the lexicon towards Pareto-optimal configurations which are comparable to those observed within human numeral systems.
Cite
Text
Thomas et al. "Learning Efficient Recursive Numeral Systems via Reinforcement Learning." ICML 2024 Workshops: AI4MATH, 2024.Markdown
[Thomas et al. "Learning Efficient Recursive Numeral Systems via Reinforcement Learning." ICML 2024 Workshops: AI4MATH, 2024.](https://mlanthology.org/icmlw/2024/thomas2024icmlw-learning/)BibTeX
@inproceedings{thomas2024icmlw-learning,
title = {{Learning Efficient Recursive Numeral Systems via Reinforcement Learning}},
author = {Thomas, Jonathan David and Silvi, Andrea and Dubhashi, Devdatt and Carlsson, Emil and Johansson, Moa},
booktitle = {ICML 2024 Workshops: AI4MATH},
year = {2024},
url = {https://mlanthology.org/icmlw/2024/thomas2024icmlw-learning/}
}