Guarantees for Tuning the Step Size Using a Learning-to-Learn Approach

Abstract

Choosing the right parameters for optimization algorithms is often the key to their success in practice. Solving this problem using a learning-to-learn approach—using meta-gradient descent on a meta-objective based on the trajectory that the optimizer generates—was recently shown to be effective. However, the meta-optimization problem is difficult. In particular, the meta-gradient can often explode/vanish, and the learned optimizer may not have good generalization performance if the meta-objective is not chosen carefully. In this paper we give meta-optimization guarantees for the learning-to-learn approach on a simple problem of tuning the step size for quadratic loss. Our results show that the naïve objective suffers from meta-gradient explosion/vanishing problem. Although there is a way to design the meta-objective so that the meta-gradient remains polynomially bounded, computing the meta-gradient directly using backpropagation leads to numerical issues. We also characterize when it is necessary to compute the meta-objective on a separate validation set to ensure the generalization performance of the learned optimizer. Finally, we verify our results empirically and show that a similar phenomenon appears even for more complicated learned optimizers parametrized by neural networks.

Cite

Text

Wang et al. "Guarantees for Tuning the Step Size Using a Learning-to-Learn Approach." International Conference on Machine Learning, 2021.

Markdown

[Wang et al. "Guarantees for Tuning the Step Size Using a Learning-to-Learn Approach." International Conference on Machine Learning, 2021.](https://mlanthology.org/icml/2021/wang2021icml-guarantees/)

BibTeX

@inproceedings{wang2021icml-guarantees,
  title     = {{Guarantees for Tuning the Step Size Using a Learning-to-Learn Approach}},
  author    = {Wang, Xiang and Yuan, Shuai and Wu, Chenwei and Ge, Rong},
  booktitle = {International Conference on Machine Learning},
  year      = {2021},
  pages     = {10981-10990},
  volume    = {139},
  url       = {https://mlanthology.org/icml/2021/wang2021icml-guarantees/}
}