Pass-Efficient Randomized SVD with Boosted Accuracy
Abstract
Singular value decomposition (SVD) is a widely used tool in data analysis and numerical linear algebra. Computing truncated SVD of a very large matrix encounters difficulty due to excessive time and memory cost. In this work, we aim to tackle this difficulty and enable accurate SVD computation for the large data which cannot be loaded into memory. We first propose a randomized SVD algorithm with fewer passes over the matrix. It reduces the passes in the basic randomized SVD by half, almost not sacrificing accuracy. Then, a shifted power iteration technique is proposed to improve the accuracy of result, where a dynamic scheme of updating the shift value in each power iteration is included. Finally, collaborating the proposed techniques with several accelerating skills, we develop a P ass- e fficient r andomized SVD (PerSVD) algorithm for efficient and accurate treatment of large data stored on hard disk. Experiments on synthetic and real-world data validate that the proposed techniques largely improve the accuracy of randomized SVD with same number of passes over the matrix. With 3 or 4 passes over the data, PerSVD is able to reduce the error of SVD result by three or four orders of magnitude compared with the basic randomized SVD and single-pass SVD algorithms, with similar or less runtime and less memory usage.
Cite
Text
Feng et al. "Pass-Efficient Randomized SVD with Boosted Accuracy." European Conference on Machine Learning and Principles and Practice of Knowledge Discovery in Databases, 2022. doi:10.1007/978-3-031-26387-3_1Markdown
[Feng et al. "Pass-Efficient Randomized SVD with Boosted Accuracy." European Conference on Machine Learning and Principles and Practice of Knowledge Discovery in Databases, 2022.](https://mlanthology.org/ecmlpkdd/2022/feng2022ecmlpkdd-passefficient/) doi:10.1007/978-3-031-26387-3_1BibTeX
@inproceedings{feng2022ecmlpkdd-passefficient,
title = {{Pass-Efficient Randomized SVD with Boosted Accuracy}},
author = {Feng, Xu and Yu, Wenjian and Xie, Yuyang},
booktitle = {European Conference on Machine Learning and Principles and Practice of Knowledge Discovery in Databases},
year = {2022},
pages = {3-20},
doi = {10.1007/978-3-031-26387-3_1},
url = {https://mlanthology.org/ecmlpkdd/2022/feng2022ecmlpkdd-passefficient/}
}