On the Geometry and Topology of Representations: The Manifolds of Modular Addition

Abstract

The Clock and Pizza interpretations, associated with architectures differing in either uniform or learnable attention, were introduced to argue that different architectural designs can yield distinct circuits for modular addition. In this work, we show that this is not the case, and that both the uniform and trainable attention architectures implement the same algorithm via topologically and geometrically equivalent representations. Our methodology goes beyond the interpretation of individual neurons and weights. Instead, we identify all of the neurons corresponding to each learned representation and then study the collective group of neurons as one entity. This method reveals that each learned representation is a manifold that we can study utilizing tools from topology. Based on this insight, we can statistically analyze the learned representations across hundreds of circuits to demonstrate the similarity between learned modular addition circuits that arise naturally from common deep learning paradigms.

Cite

Text

Moisescu-Pareja et al. "On the Geometry and Topology of Representations: The Manifolds of Modular Addition." International Conference on Learning Representations, 2026.

Markdown

[Moisescu-Pareja et al. "On the Geometry and Topology of Representations: The Manifolds of Modular Addition." International Conference on Learning Representations, 2026.](https://mlanthology.org/iclr/2026/moisescupareja2026iclr-geometry/)

BibTeX

@inproceedings{moisescupareja2026iclr-geometry,
  title     = {{On the Geometry and Topology of Representations: The Manifolds of Modular Addition}},
  author    = {Moisescu-Pareja, Gabriela and McCracken, Gavin and Wiltzer, Harley and Daniels, Colin and Létourneau, Vincent and Precup, Doina and Love, Jonathan},
  booktitle = {International Conference on Learning Representations},
  year      = {2026},
  url       = {https://mlanthology.org/iclr/2026/moisescupareja2026iclr-geometry/}
}