Repeated Examples Help Learn Arithmetic
Abstract
We study small transformers trained on two problems of arithmetic: the greatest common divisor (GCD) and modular multiplication, and show that models trained on a limited set of repeated examples achieve better performance than models trained from unlimited data. In fact, modular multiplication is only learned on small training sets. We also demonstrate that two-set training - repeated use of a small random subset of examples, along normal sampling on the rest of the training set - provides for faster learning and better performance. These experiments highlight that the benefits of repetition can outweigh those of data diversity; and shed light on the still poorly understood interplay between generalization and memorization in deep learning.
Cite
Text
Charton and Kempe. "Repeated Examples Help Learn Arithmetic." NeurIPS 2024 Workshops: MATH-AI, 2024.Markdown
[Charton and Kempe. "Repeated Examples Help Learn Arithmetic." NeurIPS 2024 Workshops: MATH-AI, 2024.](https://mlanthology.org/neuripsw/2024/charton2024neuripsw-repeated/)BibTeX
@inproceedings{charton2024neuripsw-repeated,
title = {{Repeated Examples Help Learn Arithmetic}},
author = {Charton, Francois and Kempe, Julia},
booktitle = {NeurIPS 2024 Workshops: MATH-AI},
year = {2024},
url = {https://mlanthology.org/neuripsw/2024/charton2024neuripsw-repeated/}
}