Online Composite Optimization Between Stochastic and Adversarial Environments

Wang, Yibo; Chen, Sijia; Jiang, Wei; Yang, Wenhao; Wan, Yuanyu; Zhang, Lijun

doi:10.52202/079017-3005

Online Composite Optimization Between Stochastic and Adversarial Environments

Yibo Wang, Sijia Chen, Wei Jiang, Wenhao Yang, Yuanyu Wan, Lijun Zhang

NeurIPS 2024

doi:10.52202/079017-3005 /neurips/2024/wang2024neurips-online/

Abstract

We study online composite optimization under the Stochastically Extended Adversarial (SEA) model. Specifically, each loss function consists of two parts: a fixed non-smooth and convex regularizer, and a time-varying function which can be chosen either stochastically, adversarially, or in a manner that interpolates between the two extremes. In this setting, we show that for smooth and convex time-varying functions, optimistic composite mirror descent (OptCMD) can obtain an $\mathcal{O}(\sqrt{\sigma_{1:T}^2} + \sqrt{\Sigma_{1:T}^2})$ regret bound, where $\sigma_{1:T}^2$ and $\Sigma_{1:T}^2$ denote the cumulative stochastic variance and the cumulative adversarial variation of time-varying functions, respectively. For smooth and strongly convex time-varying functions, we establish an $\mathcal{O}((\sigma_{\max}^2 + \Sigma_{\max}^2)\log(\sigma_{1:T}^2 + \Sigma_{1:T}^2))$ regret bound, where $\sigma_{\max}^2$ and $\Sigma_{\max}^2$ denote the maximal stochastic variance and the maximal adversarial variation, respectively. For smooth and exp-concave time-varying functions, we achieve an $\mathcal{O}(d \log (\sigma_{1:T}^2 + \Sigma_{1:T}^2))$ bound where $d$ denotes the dimensionality. Moreover, to deal with the unknown function type in practical problems, we propose a multi-level \textit{universal} algorithm that is able to achieve the desirable bounds for three types of time-varying functions simultaneously. It should be noticed that all our findings match existing bounds for the SEA model without the regularizer, which implies that there is \textit{no price} in regret bounds for the benefits gained from the regularizer.

PDF NeurIPS OpenReview Semantic Scholar

Cite

Text

Wang et al. "Online Composite Optimization Between Stochastic and Adversarial Environments." Neural Information Processing Systems, 2024. doi:10.52202/079017-3005

Markdown

[Wang et al. "Online Composite Optimization Between Stochastic and Adversarial Environments." Neural Information Processing Systems, 2024.](https://mlanthology.org/neurips/2024/wang2024neurips-online/) doi:10.52202/079017-3005

BibTeX

@inproceedings{wang2024neurips-online,
  title     = {{Online Composite Optimization Between Stochastic and Adversarial Environments}},
  author    = {Wang, Yibo and Chen, Sijia and Jiang, Wei and Yang, Wenhao and Wan, Yuanyu and Zhang, Lijun},
  booktitle = {Neural Information Processing Systems},
  year      = {2024},
  doi       = {10.52202/079017-3005},
  url       = {https://mlanthology.org/neurips/2024/wang2024neurips-online/}
}