On the Complexity of First-Order Methods in Stochastic Bilevel Optimization

Abstract

We consider the problem of finding stationary points in Bilevel optimization when the lower-level problem is unconstrained and strongly convex. The problem has been extensively studied in recent years; the main technical challenge is to keep track of lower-level solutions $y^*(x)$ in response to the changes in the upper-level variables $x$. Subsequently, all existing approaches tie their analyses to a genie algorithm that knows lower-level solutions and, therefore, need not query any points far from them. We consider a dual question to such approaches: suppose we have an oracle, which we call $y^*$-aware, that returns an $O(\epsilon)$-estimate of the lower-level solution, in addition to first-order gradient estimators locally unbiased within the $\Theta(\epsilon)$-ball around $y^*(x)$. We study the complexity of finding stationary points with such an $y^*$-aware oracle: we propose a simple first-order method that converges to an $\epsilon$ stationary point using $O(\epsilon^{-6}), O(\epsilon^{-4})$ access to first-order $y^*$-aware oracles. Our upper bounds also apply to standard unbiased first-order oracles, improving the best-known complexity of first-order methods by $O(\epsilon)$ with minimal assumptions. We then provide the matching $\Omega(\epsilon^{-6})$, $\Omega(\epsilon^{-4})$ lower bounds without and with an additional smoothness assumption, respectively. Our results imply that any approach that simulates an algorithm with an $y^*$-aware oracle must suffer the same lower bounds.

Cite

Text

Kwon et al. "On the Complexity of First-Order Methods in Stochastic Bilevel Optimization." International Conference on Machine Learning, 2024.

Markdown

[Kwon et al. "On the Complexity of First-Order Methods in Stochastic Bilevel Optimization." International Conference on Machine Learning, 2024.](https://mlanthology.org/icml/2024/kwon2024icml-complexity/)

BibTeX

@inproceedings{kwon2024icml-complexity,
  title     = {{On the Complexity of First-Order Methods in Stochastic Bilevel Optimization}},
  author    = {Kwon, Jeongyeol and Kwon, Dohyun and Lyu, Hanbaek},
  booktitle = {International Conference on Machine Learning},
  year      = {2024},
  pages     = {25784-25811},
  volume    = {235},
  url       = {https://mlanthology.org/icml/2024/kwon2024icml-complexity/}
}