Interactive Learning of Single-Index Models via Stochastic Gradient Descent

Abstract

Stochastic gradient descent (SGD) is a cornerstone algorithm for high-dimensional optimization, renowned for its empirical successes. Recent theoretical advances have provided a deep understanding of how SGD enables feature learning in high-dimensional nonlinear models, most notably the *single-index model* with i.i.d. data. In this work, we study the sequential learning problem for single-index models, also known as generalized linear bandits or ridge bandits, where SGD is a simple and natural solution, yet its learning dynamics remain largely unexplored. We show that, similar to the optimal interactive learner, SGD undergoes a distinct "burn-in" phase before entering the "learning" phase in this setting. Moreover, with an appropriately chosen learning rate schedule, a single SGD procedure simultaneously achieves near-optimal (or best-known) sample complexity and regret guarantees across both phases, for a broad class of link functions. Our results demonstrate that SGD remains highly competitive for learning single-index models under adaptive data.

Cite

Text

Rajaraman and Han. "Interactive Learning of Single-Index Models via Stochastic Gradient Descent." International Conference on Learning Representations, 2026.

Markdown

[Rajaraman and Han. "Interactive Learning of Single-Index Models via Stochastic Gradient Descent." International Conference on Learning Representations, 2026.](https://mlanthology.org/iclr/2026/rajaraman2026iclr-interactive/)

BibTeX

@inproceedings{rajaraman2026iclr-interactive,
  title     = {{Interactive Learning of Single-Index Models via Stochastic Gradient Descent}},
  author    = {Rajaraman, Nived and Han, Yanjun},
  booktitle = {International Conference on Learning Representations},
  year      = {2026},
  url       = {https://mlanthology.org/iclr/2026/rajaraman2026iclr-interactive/}
}