Transformers Learn Low Sensitivity Functions: Investigations and Implications
Abstract
Transformers achieve state-of-the-art accuracy and robustness across many tasks, but an understanding of their inductive biases and how those biases differ from other neural network architectures remains elusive. In this work, we identify the sensitivity of the model to token-wise random perturbations in the input as a unified metric which explains the inductive bias of transformers across different data modalities and distinguishes them from other architectures. We show that transformers have lower sensitivity than MLPs, CNNs, ConvMixers and LSTMs, across both vision and language tasks. We also show that this low-sensitivity bias has important implications: i) lower sensitivity correlates with improved robustness; it can also be used as an efficient intervention to further improve the robustness of transformers; ii) it corresponds to flatter minima in the loss landscape; and iii) it can serve as a progress measure for grokking. We support these findings with theoretical results showing (weak) spectral bias of transformers in the NTK regime, and improved robustness due to the lower sensitivity.
Cite
Text
Vasudeva et al. "Transformers Learn Low Sensitivity Functions: Investigations and Implications." International Conference on Learning Representations, 2025.Markdown
[Vasudeva et al. "Transformers Learn Low Sensitivity Functions: Investigations and Implications." International Conference on Learning Representations, 2025.](https://mlanthology.org/iclr/2025/vasudeva2025iclr-transformers/)BibTeX
@inproceedings{vasudeva2025iclr-transformers,
title = {{Transformers Learn Low Sensitivity Functions: Investigations and Implications}},
author = {Vasudeva, Bhavya and Fu, Deqing and Zhou, Tianyi and Kau, Elliott and Huang, Youqi and Sharan, Vatsal},
booktitle = {International Conference on Learning Representations},
year = {2025},
url = {https://mlanthology.org/iclr/2025/vasudeva2025iclr-transformers/}
}