Online Composite Optimization Between Stochastic and Adversarial Environments
Abstract
We study online composite optimization under the Stochastically Extended Adversarial (SEA) model. Specifically, each loss function consists of two parts: a fixed non-smooth and convex regularizer, and a time-varying function which can be chosen either stochastically, adversarially, or in a manner that interpolates between the two extremes. In this setting, we show that for smooth and convex time-varying functions, optimistic composite mirror descent (OptCMD) can obtain an $\mathcal{O}(\sqrt{\sigma_{1:T}^2} + \sqrt{\Sigma_{1:T}^2})$ regret bound, where $\sigma_{1:T}^2$ and $\Sigma_{1:T}^2$ denote the cumulative stochastic variance and the cumulative adversarial variation of time-varying functions, respectively. For smooth and strongly convex time-varying functions, we establish an $\mathcal{O}((\sigma_{\max}^2 + \Sigma_{\max}^2)\log(\sigma_{1:T}^2 + \Sigma_{1:T}^2))$ regret bound, where $\sigma_{\max}^2$ and $\Sigma_{\max}^2$ denote the maximal stochastic variance and the maximal adversarial variation, respectively. For smooth and exp-concave time-varying functions, we achieve an $\mathcal{O}(d \log (\sigma_{1:T}^2 + \Sigma_{1:T}^2))$ bound where $d$ denotes the dimensionality. Moreover, to deal with the unknown function type in practical problems, we propose a multi-level \textit{universal} algorithm that is able to achieve the desirable bounds for three types of time-varying functions simultaneously. It should be noticed that all our findings match existing bounds for the SEA model without the regularizer, which implies that there is \textit{no price} in regret bounds for the benefits gained from the regularizer.
Cite
Text
Wang et al. "Online Composite Optimization Between Stochastic and Adversarial Environments." Neural Information Processing Systems, 2024. doi:10.52202/079017-3005Markdown
[Wang et al. "Online Composite Optimization Between Stochastic and Adversarial Environments." Neural Information Processing Systems, 2024.](https://mlanthology.org/neurips/2024/wang2024neurips-online/) doi:10.52202/079017-3005BibTeX
@inproceedings{wang2024neurips-online,
title = {{Online Composite Optimization Between Stochastic and Adversarial Environments}},
author = {Wang, Yibo and Chen, Sijia and Jiang, Wei and Yang, Wenhao and Wan, Yuanyu and Zhang, Lijun},
booktitle = {Neural Information Processing Systems},
year = {2024},
doi = {10.52202/079017-3005},
url = {https://mlanthology.org/neurips/2024/wang2024neurips-online/}
}