A Distributed Frank-Wolfe Framework for Learning Low-Rank Matrices with the Trace Norm

Wenjie Zheng, Aurélien Bellet, Patrick Gallinari

MLJ 2018 pp. 1457-1475

doi:10.1007/S10994-018-5713-5 /mlj/2018/zheng2018mlj-distributed/

Abstract

We consider the problem of learning a high-dimensional but low-rank matrix from a large-scale dataset distributed over several machines, where low-rankness is enforced by a convex trace norm constraint. We propose DFW-Trace, a distributed Frank–Wolfe algorithm which leverages the low-rank structure of its updates to achieve efficiency in time, memory and communication usage. The step at the heart of DFW-Trace is solved approximately using a distributed version of the power method. We provide a theoretical analysis of the convergence of DFW-Trace, showing that we can ensure sublinear convergence in expectation to an optimal solution with few power iterations per epoch. We implement DFW-Trace in the Apache Spark distributed programming framework and validate the usefulness of our approach on synthetic and real data, including the ImageNet dataset with high-dimensional features extracted from a deep neural network.

PDF MLJ Semantic Scholar

Cite

Text

Zheng et al. "A Distributed Frank-Wolfe Framework for Learning Low-Rank Matrices with the Trace Norm." Machine Learning, 2018. doi:10.1007/S10994-018-5713-5

Markdown

[Zheng et al. "A Distributed Frank-Wolfe Framework for Learning Low-Rank Matrices with the Trace Norm." Machine Learning, 2018.](https://mlanthology.org/mlj/2018/zheng2018mlj-distributed/) doi:10.1007/S10994-018-5713-5

BibTeX

@article{zheng2018mlj-distributed,
  title     = {{A Distributed Frank-Wolfe Framework for Learning Low-Rank Matrices with the Trace Norm}},
  author    = {Zheng, Wenjie and Bellet, Aurélien and Gallinari, Patrick},
  journal   = {Machine Learning},
  year      = {2018},
  pages     = {1457-1475},
  doi       = {10.1007/S10994-018-5713-5},
  volume    = {107},
  url       = {https://mlanthology.org/mlj/2018/zheng2018mlj-distributed/}
}