Stability and Generalization for Randomized Coordinate Descent

Abstract

Randomized coordinate descent (RCD) is a popular optimization algorithm with wide applications in various machine learning problems, which motivates a lot of theoretical analysis on its convergence behavior. As a comparison, there is no work studying how the models trained by RCD would generalize to test examples. In this paper, we initialize the generalization analysis of RCD by leveraging the powerful tool of algorithmic stability. We establish argument stability bounds of RCD for both convex and strongly convex objectives, from which we develop optimal generalization bounds by showing how to early-stop the algorithm to tradeoff the estimation and optimization. Our analysis shows that RCD enjoys better stability as compared to stochastic gradient descent.

Cite

Text

Wang et al. "Stability and Generalization for Randomized Coordinate Descent." International Joint Conference on Artificial Intelligence, 2021. doi:10.24963/IJCAI.2021/427

Markdown

[Wang et al. "Stability and Generalization for Randomized Coordinate Descent." International Joint Conference on Artificial Intelligence, 2021.](https://mlanthology.org/ijcai/2021/wang2021ijcai-stability/) doi:10.24963/IJCAI.2021/427

BibTeX

@inproceedings{wang2021ijcai-stability,
  title     = {{Stability and Generalization for Randomized Coordinate Descent}},
  author    = {Wang, Puyu and Wu, Liang and Lei, Yunwen},
  booktitle = {International Joint Conference on Artificial Intelligence},
  year      = {2021},
  pages     = {3104-3110},
  doi       = {10.24963/IJCAI.2021/427},
  url       = {https://mlanthology.org/ijcai/2021/wang2021ijcai-stability/}
}