Dimension-Free Adaptive Subgradient Methods with Frequent Directions
Abstract
In this paper, we investigate the acceleration of adaptive subgradient methods through frequent directions (FD), a widely-used matrix sketching technique. The state-of-the-art regret bound exhibits a linear dependence on the dimensionality $d$, leading to unsatisfactory guarantees for high-dimensional problems. Additionally, it suffers from an $O(\tau^2 d)$ time complexity per round, which scales quadratically with the sketching size $\tau$. To overcome these issues, we first propose an algorithm named FTSL, achieving a tighter regret bound that is independent of the dimensionality. The key idea is to integrate FD with adaptive subgradient methods under the primal-dual framework and add the cumulative discarded information of FD back. To reduce its time complexity, we further utilize fast FD to expedite FTSL, yielding a better complexity of $O(\tau d)$ while maintaining the same regret bound. Moreover, to mitigate the computational cost for optimization problems involving matrix variables (e.g., training neural networks), we adapt FD to Shampoo, a popular optimization algorithm that accounts for the structure of decision, and give a novel analysis under the primal-dual framework. Our proposed method obtains an improved dimension-free regret bound. Experimental results have verified the efficiency and effectiveness of our approaches.
Cite
Text
Yang et al. "Dimension-Free Adaptive Subgradient Methods with Frequent Directions." Proceedings of the 42nd International Conference on Machine Learning, 2025.Markdown
[Yang et al. "Dimension-Free Adaptive Subgradient Methods with Frequent Directions." Proceedings of the 42nd International Conference on Machine Learning, 2025.](https://mlanthology.org/icml/2025/yang2025icml-dimensionfree/)BibTeX
@inproceedings{yang2025icml-dimensionfree,
title = {{Dimension-Free Adaptive Subgradient Methods with Frequent Directions}},
author = {Yang, Sifan and Wan, Yuanyu and Li, Peijia and Wang, Yibo and Zhang, Xiao and Wei, Zhewei and Zhang, Lijun},
booktitle = {Proceedings of the 42nd International Conference on Machine Learning},
year = {2025},
pages = {71249-71274},
volume = {267},
url = {https://mlanthology.org/icml/2025/yang2025icml-dimensionfree/}
}