Stability and Generalization for Randomized Coordinate Descent

IJCAI 2021 pp. 3104-3110

doi:10.24963/IJCAI.2021/427 /ijcai/2021/wang2021ijcai-stability/

Abstract

Randomized coordinate descent (RCD) is a popular optimization algorithm with wide applications in various machine learning problems, which motivates a lot of theoretical analysis on its convergence behavior. As a comparison, there is no work studying how the models trained by RCD would generalize to test examples. In this paper, we initialize the generalization analysis of RCD by leveraging the powerful tool of algorithmic stability. We establish argument stability bounds of RCD for both convex and strongly convex objectives, from which we develop optimal generalization bounds by showing how to early-stop the algorithm to tradeoff the estimation and optimization. Our analysis shows that RCD enjoys better stability as compared to stochastic gradient descent.

PDF IJCAI Semantic Scholar

Cite

Text

Wang et al. "Stability and Generalization for Randomized Coordinate Descent." International Joint Conference on Artificial Intelligence, 2021. doi:10.24963/IJCAI.2021/427

Markdown

[Wang et al. "Stability and Generalization for Randomized Coordinate Descent." International Joint Conference on Artificial Intelligence, 2021.](https://mlanthology.org/ijcai/2021/wang2021ijcai-stability/) doi:10.24963/IJCAI.2021/427

BibTeX

@inproceedings{wang2021ijcai-stability,
  title     = {{Stability and Generalization for Randomized Coordinate Descent}},
  author    = {Wang, Puyu and Wu, Liang and Lei, Yunwen},
  booktitle = {International Joint Conference on Artificial Intelligence},
  year      = {2021},
  pages     = {3104-3110},
  doi       = {10.24963/IJCAI.2021/427},
  url       = {https://mlanthology.org/ijcai/2021/wang2021ijcai-stability/}
}