Repeated Examples Help Learn Arithmetic

Abstract

We study small transformers trained on two problems of arithmetic: the greatest common divisor (GCD) and modular multiplication, and show that models trained on a limited set of repeated examples achieve better performance than models trained from unlimited data. In fact, modular multiplication is only learned on small training sets. We also demonstrate that two-set training - repeated use of a small random subset of examples, along normal sampling on the rest of the training set - provides for faster learning and better performance. These experiments highlight that the benefits of repetition can outweigh those of data diversity; and shed light on the still poorly understood interplay between generalization and memorization in deep learning.

PDF NeurIPSW OpenReview Semantic Scholar

Cite

Text

Charton and Kempe. "Repeated Examples Help Learn Arithmetic." NeurIPS 2024 Workshops: MATH-AI, 2024.

Markdown

[Charton and Kempe. "Repeated Examples Help Learn Arithmetic." NeurIPS 2024 Workshops: MATH-AI, 2024.](https://mlanthology.org/neuripsw/2024/charton2024neuripsw-repeated/)

BibTeX

@inproceedings{charton2024neuripsw-repeated,
  title     = {{Repeated Examples Help Learn Arithmetic}},
  author    = {Charton, Francois and Kempe, Julia},
  booktitle = {NeurIPS 2024 Workshops: MATH-AI},
  year      = {2024},
  url       = {https://mlanthology.org/neuripsw/2024/charton2024neuripsw-repeated/}
}