Online Learning with Optimism and Delay
Abstract
Inspired by the demands of real-time climate and weather forecasting, we develop optimistic online learning algorithms that require no parameter tuning and have optimal regret guarantees under delayed feedback. Our algorithms—DORM, DORM+, and AdaHedgeD—arise from a novel reduction of delayed online learning to optimistic online learning that reveals how optimistic hints can mitigate the regret penalty caused by delay. We pair this delay-as-optimism perspective with a new analysis of optimistic learning that exposes its robustness to hinting errors and a new meta-algorithm for learning effective hinting strategies in the presence of delay. We conclude by benchmarking our algorithms on four subseasonal climate forecasting tasks, demonstrating low regret relative to state-of-the-art forecasting models.
Cite
Text
Flaspohler et al. "Online Learning with Optimism and Delay." International Conference on Machine Learning, 2021.Markdown
[Flaspohler et al. "Online Learning with Optimism and Delay." International Conference on Machine Learning, 2021.](https://mlanthology.org/icml/2021/flaspohler2021icml-online/)BibTeX
@inproceedings{flaspohler2021icml-online,
title = {{Online Learning with Optimism and Delay}},
author = {Flaspohler, Genevieve E and Orabona, Francesco and Cohen, Judah and Mouatadid, Soukayna and Oprescu, Miruna and Orenstein, Paulo and Mackey, Lester},
booktitle = {International Conference on Machine Learning},
year = {2021},
pages = {3363-3373},
volume = {139},
url = {https://mlanthology.org/icml/2021/flaspohler2021icml-online/}
}