LoRANN: Low-Rank Matrix Factorization for Approximate Nearest Neighbor Search
Abstract
Approximate nearest neighbor (ANN) search is a key component in many modern machine learning pipelines; recent use cases include retrieval-augmented generation (RAG) and vector databases. Clustering-based ANN algorithms, that use score computation methods based on product quantization (PQ), are often used in industrial-scale applications due to their scalability and suitability for distributed and disk-based implementations. However, they have slower query times than the leading graph-based ANN algorithms. In this work, we propose a new supervised score computation method based on the observation that inner product approximation is a multivariate (multi-output) regression problem that can be solved efficiently by reduced-rank regression. Our experiments show that on modern high-dimensional data sets, the proposed reduced-rank regression (RRR) method is superior to PQ in both query latency and memory usage. We also introduce LoRANN, a clustering-based ANN library that leverages the proposed score computation method. LoRANN is competitive with the leading graph-based algorithms and outperforms the state-of-the-art GPU ANN methods on high-dimensional data sets.
Cite
Text
Jääsaari et al. "LoRANN: Low-Rank Matrix Factorization for Approximate Nearest Neighbor Search." Neural Information Processing Systems, 2024. doi:10.52202/079017-3242Markdown
[Jääsaari et al. "LoRANN: Low-Rank Matrix Factorization for Approximate Nearest Neighbor Search." Neural Information Processing Systems, 2024.](https://mlanthology.org/neurips/2024/jaasaari2024neurips-lorann/) doi:10.52202/079017-3242BibTeX
@inproceedings{jaasaari2024neurips-lorann,
title = {{LoRANN: Low-Rank Matrix Factorization for Approximate Nearest Neighbor Search}},
author = {Jääsaari, Elias and Hyvönen, Ville and Roos, Teemu},
booktitle = {Neural Information Processing Systems},
year = {2024},
doi = {10.52202/079017-3242},
url = {https://mlanthology.org/neurips/2024/jaasaari2024neurips-lorann/}
}