CSPG: Crossing Sparse Proximity Graphs for Approximate Nearest Neighbor Search

Abstract

The state-of-the-art approximate nearest neighbor search (ANNS) algorithm builds a large proximity graph on the dataset and performs a greedy beam search, which may bring many unnecessary explorations. We develop a novel framework, namely corssing sparse proximity graph (CSPG), based on random partitioning of the dataset. It produces a smaller sparse proximity graph for each partition and routing vectors that bind all the partitions. An efficient two-staged approach is designed for exploring CSPG, with fast approaching and cross-partition expansion. We theoretically prove that CSPG can accelerate the existing graph-based ANNS algorithms by reducing unnecessary explorations. In addition, we conduct extensive experiments on benchmark datasets. The experimental results confirm that the existing graph-based methods can be significantly outperformed by incorporating CSPG, achieving 1.5x to 2x speedups of QPS in almost all recalls.

Cite

Text

Yang et al. "CSPG: Crossing Sparse Proximity Graphs for Approximate Nearest Neighbor Search." Neural Information Processing Systems, 2024. doi:10.52202/079017-3275

Markdown

[Yang et al. "CSPG: Crossing Sparse Proximity Graphs for Approximate Nearest Neighbor Search." Neural Information Processing Systems, 2024.](https://mlanthology.org/neurips/2024/yang2024neurips-cspg/) doi:10.52202/079017-3275

BibTeX

@inproceedings{yang2024neurips-cspg,
  title     = {{CSPG: Crossing Sparse Proximity Graphs for Approximate Nearest Neighbor Search}},
  author    = {Yang, Ming and Cai, Yuzheng and Zheng, Weiguo},
  booktitle = {Neural Information Processing Systems},
  year      = {2024},
  doi       = {10.52202/079017-3275},
  url       = {https://mlanthology.org/neurips/2024/yang2024neurips-cspg/}
}