Repeated Examples Help Learn Arithmetic

Abstract

We study small transformers trained on two problems of arithmetic: the greatest common divisor (GCD) and modular multiplication, and show that models trained on a limited set of repeated examples achieve better performance than models trained from unlimited data. In fact, modular multiplication is only learned on small training sets. We also demonstrate that two-set training - repeated use of a small random subset of examples, along normal sampling on the rest of the training set - provides for faster learning and better performance. These experiments highlight that the benefits of repetition can outweigh those of data diversity; and shed light on the still poorly understood interplay between generalization and memorization in deep learning.

Cite

Text

Charton and Kempe. "Repeated Examples Help Learn Arithmetic." NeurIPS 2024 Workshops: MATH-AI, 2024.

Markdown

[Charton and Kempe. "Repeated Examples Help Learn Arithmetic." NeurIPS 2024 Workshops: MATH-AI, 2024.](https://mlanthology.org/neuripsw/2024/charton2024neuripsw-repeated/)

BibTeX

@inproceedings{charton2024neuripsw-repeated,
  title     = {{Repeated Examples Help Learn Arithmetic}},
  author    = {Charton, Francois and Kempe, Julia},
  booktitle = {NeurIPS 2024 Workshops: MATH-AI},
  year      = {2024},
  url       = {https://mlanthology.org/neuripsw/2024/charton2024neuripsw-repeated/}
}