Log-Linear Attention

Guo, Han; Yang, Songlin; Goel, Tarushii; Xing, Eric P.; Dao, Tri; Kim, Yoon

Log-Linear Attention

Han Guo, Songlin Yang, Tarushii Goel, Eric P. Xing, Tri Dao, Yoon Kim

ICLR 2026

/iclr/2026/guo2026iclr-loglinear/

Abstract

The attention mechanism in Transformers is an important primitive for accurate and scalable sequence modeling. Its quadratic-compute and linear-memory complexity however remain significant bottlenecks. Linear attention and state-space models enable linear-time, constant-memory sequence modeling and can moreover be trained efficiently through matmul-rich parallelization across sequence length. However, at their core these models are still RNNs, and thus their use of a fixed-size hidden state to model the context is a fundamental limitation. This paper develops log-linear attention, an attention mechanism that balances linear attention's efficiency and the expressiveness of softmax attention. Log-linear attention replaces the fixed-size hidden state with a logarithmically growing set of hidden states. We show that with a particular growth function, log-linear attention admits a similarly matmul-rich parallel form whose compute cost is log-linear in sequence length. Log-linear attention is a general framework and can be applied on top of existing linear attention variants. As case studies, we instantiate log-linear variants of two recent architectures---Mamba-2 and Gated DeltaNet---and find they perform well compared to their linear-time variants.

PDF ICLR OpenReview Semantic Scholar

Cite

Text

Guo et al. "Log-Linear Attention." International Conference on Learning Representations, 2026.

Markdown

[Guo et al. "Log-Linear Attention." International Conference on Learning Representations, 2026.](https://mlanthology.org/iclr/2026/guo2026iclr-loglinear/)

BibTeX

@inproceedings{guo2026iclr-loglinear,
  title     = {{Log-Linear Attention}},
  author    = {Guo, Han and Yang, Songlin and Goel, Tarushii and Xing, Eric P. and Dao, Tri and Kim, Yoon},
  booktitle = {International Conference on Learning Representations},
  year      = {2026},
  url       = {https://mlanthology.org/iclr/2026/guo2026iclr-loglinear/}
}