Householder Sketch for Accurate and Accelerated Least-Mean-Squares Solvers
Abstract
Least-Mean-Squares (\textsc{LMS}) solvers comprise a class of fundamental optimization problems such as linear regression, and regularized regressions such as Ridge, LASSO, and Elastic-Net. Data summarization techniques for big data generate summaries called coresets and sketches to speed up model learning under streaming and distributed settings. For example, \citep{nips2019} design a fast and accurate Caratheodory set on input data to boost the performance of existing \textsc{LMS} solvers. In retrospect, we explore classical Householder transformation as a candidate for sketching and accurately solving LMS problems. We find it to be a simpler, memory-efficient, and faster alternative that always existed to the above strong baseline. We also present a scalable algorithm based on the construction of distributed Householder sketches to solve \textsc{LMS} problem across multiple worker nodes. We perform thorough empirical analysis with large synthetic and real datasets to evaluate the performance of Householder sketch and compare with \citep{nips2019}. Our results show Householder sketch speeds up existing \textsc{LMS} solvers in the scikit-learn library up to $100$x-$400$x. Also, it is $10$x-$100$x faster than the above baseline with similar numerical stability. The distributed algorithm demonstrates linear scalability with a near-negligible communication overhead.
Cite
Text
Dass and Mahapatra. "Householder Sketch for Accurate and Accelerated Least-Mean-Squares Solvers." International Conference on Machine Learning, 2021.Markdown
[Dass and Mahapatra. "Householder Sketch for Accurate and Accelerated Least-Mean-Squares Solvers." International Conference on Machine Learning, 2021.](https://mlanthology.org/icml/2021/dass2021icml-householder/)BibTeX
@inproceedings{dass2021icml-householder,
title = {{Householder Sketch for Accurate and Accelerated Least-Mean-Squares Solvers}},
author = {Dass, Jyotikrishna and Mahapatra, Rabi},
booktitle = {International Conference on Machine Learning},
year = {2021},
pages = {2467-2477},
volume = {139},
url = {https://mlanthology.org/icml/2021/dass2021icml-householder/}
}