Efficient Adaptive Online Learning via Frequent Directions

Abstract

By employing time-varying proximal functions, adaptive subgradient methods (ADAGRAD) have improved the regret bound and been widely used in online learning and optimization. However, ADAGRAD with full matrix proximal functions (ADA-FULL) cannot deal with large-scale problems due to the impractical time and space complexities, though it has better performance when gradients are correlated. In this paper, we propose ADA-FD, an efficient variant of ADA-FULL based on a deterministic matrix sketching technique called frequent directions. Following ADA-FULL, we incorporate our ADA-FD into both primal-dual subgradient method and composite mirror descent method to develop two efficient methods. By maintaining and manipulating low-rank matrices, at each iteration, the space complexity is reduced from $O(d^2)$ to $O(\tau d)$ and the time complexity is reduced from $O(d^3)$ to $O(\tau^2d)$, where $d$ is the dimensionality of the data and $\tau \ll d$ is the sketching size. Theoretical analysis reveals that the regret of our methods is close to that of ADA-FULL as long as the outer product matrix of gradients is approximately low-rank. Experimental results show that our ADA-FD is comparable to ADA-FULL and outperforms other state-of-the-art algorithms in online convex optimization as well as in training convolutional neural networks (CNN).

Cite

Text

Wan et al. "Efficient Adaptive Online Learning via Frequent Directions." International Joint Conference on Artificial Intelligence, 2018. doi:10.24963/IJCAI.2018/381

Markdown

[Wan et al. "Efficient Adaptive Online Learning via Frequent Directions." International Joint Conference on Artificial Intelligence, 2018.](https://mlanthology.org/ijcai/2018/wan2018ijcai-efficient/) doi:10.24963/IJCAI.2018/381

BibTeX

@inproceedings{wan2018ijcai-efficient,
  title     = {{Efficient Adaptive Online Learning via Frequent Directions}},
  author    = {Wan, Yuanyu and Wei, Nan and Zhang, Lijun},
  booktitle = {International Joint Conference on Artificial Intelligence},
  year      = {2018},
  pages     = {2748-2754},
  doi       = {10.24963/IJCAI.2018/381},
  url       = {https://mlanthology.org/ijcai/2018/wan2018ijcai-efficient/}
}