Debiasing a First-Order Heuristic for Approximate Bi-Level Optimization

Abstract

Approximate bi-level optimization (ABLO) consists of (outer-level) optimization problems, involving numerical (inner-level) optimization loops. While ABLO has many applications across deep learning, it suffers from time and memory complexity proportional to the length $r$ of its inner optimization loop. To address this complexity, an earlier first-order method (FOM) was proposed as a heuristic which omits second derivative terms, yielding significant speed gains and requiring only constant memory. Despite FOM’s popularity, there is a lack of theoretical understanding of its convergence properties. We contribute by theoretically characterizing FOM’s gradient bias under mild assumptions. We further demonstrate a rich family of examples where FOM-based SGD does not converge to a stationary point of the ABLO objective. We address this concern by proposing an unbiased FOM (UFOM) enjoying constant memory complexity as a function of $r$. We characterize the introduced time-variance tradeoff, demonstrate convergence bounds, and find an optimal UFOM for a given ABLO problem. Finally, we propose an efficient adaptive UFOM scheme.

Cite

Text

Likhosherstov et al. "Debiasing a First-Order Heuristic for Approximate Bi-Level Optimization." International Conference on Machine Learning, 2021.

Markdown

[Likhosherstov et al. "Debiasing a First-Order Heuristic for Approximate Bi-Level Optimization." International Conference on Machine Learning, 2021.](https://mlanthology.org/icml/2021/likhosherstov2021icml-debiasing/)

BibTeX

@inproceedings{likhosherstov2021icml-debiasing,
  title     = {{Debiasing a First-Order Heuristic for Approximate Bi-Level Optimization}},
  author    = {Likhosherstov, Valerii and Song, Xingyou and Choromanski, Krzysztof and Davis, Jared Q and Weller, Adrian},
  booktitle = {International Conference on Machine Learning},
  year      = {2021},
  pages     = {6621-6630},
  volume    = {139},
  url       = {https://mlanthology.org/icml/2021/likhosherstov2021icml-debiasing/}
}