Multi-Block Min-Max Bilevel Optimization with Applications in Multi-Task Deep AUC Maximization
Abstract
In this paper, we study multi-block min-max bilevel optimization problems, where the upper level is non-convex strongly-concave minimax objective and the lower level is a strongly convex objective, and there are multiple blocks of dual variables and lower level problems. Due to the intertwined multi-block min-max bilevel structure, the computational cost at each iteration could be prohibitively high, especially with a large number of blocks. To tackle this challenge, we present two single-loop randomized stochastic algorithms, which require updates for only a constant number of blocks at each iteration. Under some mild assumptions on the problem, we establish their sample complexity of $\mathcal{O}(1/\epsilon^4)$ for finding an $\epsilon$-stationary point. This matches the optimal complexity order for solving stochastic nonconvex optimization under a general unbiased stochastic oracle model. Moreover, we provide two applications of the proposed method in multi-task deep AUC (area under ROC curve) maximization. Experimental results validate our theory and demonstrate the effectiveness of our method.
Cite
Text
Hu et al. "Multi-Block Min-Max Bilevel Optimization with Applications in Multi-Task Deep AUC Maximization." Neural Information Processing Systems, 2022.Markdown
[Hu et al. "Multi-Block Min-Max Bilevel Optimization with Applications in Multi-Task Deep AUC Maximization." Neural Information Processing Systems, 2022.](https://mlanthology.org/neurips/2022/hu2022neurips-multiblock/)BibTeX
@inproceedings{hu2022neurips-multiblock,
title = {{Multi-Block Min-Max Bilevel Optimization with Applications in Multi-Task Deep AUC Maximization}},
author = {Hu, Quanqi and Zhong, Yongjian and Yang, Tianbao},
booktitle = {Neural Information Processing Systems},
year = {2022},
url = {https://mlanthology.org/neurips/2022/hu2022neurips-multiblock/}
}