Prune, Don't Rebuild: Efficiently Tuning $α$-Reachable Graphs for Nearest Neighbor Search
Tian Zhang, Ashwin Padaki, Jiaming Liang, Zack Ives, Erik Waingarten
TL;DR
RP-Tuning presents a practical, post-hoc pruning routine to adjust DiskANN's $\alpha$-reachable graphs without rebuilding, enabling rapid exploration of accuracy-latency-size trade-offs. Grounded in the $\alpha$-reachability framework, it proves worst-case guarantees for unsorted and sorted RobustPrune under general and Euclidean metrics, with Euclidean metrics offering improved bounds. Empirically, RP-Tuning accelerates DiskANN tuning up to $43\times$ on four public datasets and yields superior recall-per-QPS frontiers compared with rebuilds, validating both theory and practice. This approach enables index distillation across configurations tailored to diverse hardware and latency requirements while preserving high-quality navigation structure.
Abstract
Vector similarity search is an essential primitive in modern AI and ML applications. Most vector databases adopt graph-based approximate nearest neighbor (ANN) search algorithms, such as DiskANN (Subramanya et al., 2019), which have demonstrated state-of-the-art empirical performance. DiskANN's graph construction is governed by a reachability parameter $α$, which gives a trade-off between construction time, query time, and accuracy. However, adaptively tuning this trade-off typically requires rebuilding the index for different $α$ values, which is prohibitive at scale. In this work, we propose RP-Tuning, an efficient post-hoc routine, based on DiskANN's pruning step, to adjust the $α$ parameter without reconstructing the full index. Within the $α$-reachability framework of prior theoretical works (Indyk and Xu, 2023; Gollapudi et al., 2025), we prove that pruning an initially $α$-reachable graph with RP-Tuning preserves worst-case reachability guarantees in general metrics and improved guarantees in Euclidean metrics. Empirically, we show that RP-Tuning accelerates DiskANN tuning on four public datasets by up to $43\times$ with negligible overhead.
