Frayed RoPE and Long Inputs: A Geometric Perspective

Abstract

Rotary Positional Embedding (RoPE) is a widely adopted technique for encoding position in language models, which, while effective, causes performance breakdown when input length exceeds training length. Prior analyses assert (rightly) that long inputs cause channels to rotate “out of distribution,” but it is not clear how extra rotation relates to or causes pathological behavior. Through empirical and theoretical analysis we advance a unified geometric understanding of attention behavior with RoPE. We find that attention induces tight clustering of separated key and query latent point clouds, allowing for creation of sink tokens: placeholders that allow attention heads to avoid token mixing when not required. RoPE applied to longer inputs damages this key/query cluster separation, producing pathological behavior by inhibiting sink token functionality. From this geometric perspective, we propose RoPE-ID (In Distribution), a straightforward modification that allows attention layers to generalize to longer inputs out of the box: apply RoPE with high frequency to a subset of channels. We demonstrate the effectiveness of RoPE-ID for extended inputs using 1B and 3B parameter Transformers on the LongBench and RULER information retrieval benchmarks.

Cite

Text

Wertheimer et al. "Frayed RoPE and Long Inputs: A Geometric Perspective." International Conference on Learning Representations, 2026.

Markdown

[Wertheimer et al. "Frayed RoPE and Long Inputs: A Geometric Perspective." International Conference on Learning Representations, 2026.](https://mlanthology.org/iclr/2026/wertheimer2026iclr-frayed/)

BibTeX

@inproceedings{wertheimer2026iclr-frayed,
  title     = {{Frayed RoPE and Long Inputs: A Geometric Perspective}},
  author    = {Wertheimer, Davis and Zhang, Aozhong and Liu, Derrick and Yin, Penghang and Wang, Naigang},
  booktitle = {International Conference on Learning Representations},
  year      = {2026},
  url       = {https://mlanthology.org/iclr/2026/wertheimer2026iclr-frayed/}
}