Graph-Based Approximate Nearest Neighbor Search Revisited: Theoretical Analysis and Optimization
Xinran Ma, Zhaoqi Zhou, Chuan Zhou, Qi Meng, Zaijiu Shang, Guoliang Li, Zhiming Ma
TL;DR
This work addresses graph-based ANNS by focusing on Sparse Neighborhood Graph (SNG) and bridging theory with practice. It develops a martingale-based framework to analyze SNG construction, proving a tight out-degree bound of $O(n^{2/3+\epsilon})$ and a logarithmic search path length $O(\log n)$, while introducing a marginal optimality principle to select the truncation parameter $R$. The resulting principled optimization yields significant reductions in index construction time (2x–9x in some settings) and competitive or improved query performance compared to standard binary-search tuning. Empirical validation on large-scale benchmarks and a non-uniform GMM dataset confirms the theoretical predictions and demonstrates robust scalability and efficiency gains for practical deployment in systems like DiskANN/HNSW-like pipelines.
Abstract
Graph-based approaches to approximate nearest neighbor search (ANNS) have achieved remarkable success in enabling fast, high-recall retrieval on billion-scale vector datasets. Among them, the Sparse Neighborhood Graph (SNG) has emerged as a widely adopted graph structure due to its superior search performance. However, the theoretical understanding of SNG remains limited, leading to reliance on heuristic-based and often suboptimal truncation strategies. In this work, we aim to bridge the gap between theory and practice by providing formal guarantees for graph-based ANNS methods and proposing principled optimization strategies for the truncation parameter. By characterizing the index construction process through martingale-based analysis, we show that the degree of the index graph is $O(n^{2/3+ε})$, where $ε$ is an arbitrarily small constant. Furthermore, we prove that the expected search path length during query processing is $O(\log n)$. Based on these theoretical insights, we introduce a novel and principled method for selecting the truncation parameter $R$ in SNG. Experimental results demonstrate that our method achieves comparable or superior performance in terms of query latency and Recall@10 compared to commonly used binary search heuristics, while yielding 2x to 9x speedups in overall index construction.
