$\boldsymbol{Steiner}$-Hardness: A Query Hardness Measure for Graph-Based ANN Indexes
Zeyu Wang, Qitong Wang, Xiaoxing Cheng, Peng Wang, Themis Palpanas, Wei Wang
TL;DR
This work addresses the instability of query performance in graph-based ANN indexes by introducing Steiner-hardness, a graph-native hardness measure defined as the minimum query effort on a representative MRNG graph and computed via connections to Directed Steiner Tree formulations. The authors develop a comprehensive framework for Minimum Effort (ME), adapt it to greedy graph searches, incorporate decision costs, and map the problem to DST and related problems with efficient Steiner-tree solvers. They demonstrate that Steiner-hardness correlates more strongly with actual query effort than prior measures and use unbiased workload generation to reveal robust index behavior, challenging some expectations from old benchmarks. The practical impact lies in providing a principled, graph-aware metric and benchmarks to guide index selection and robustness improvements in real-world high-dimensional similarity search systems.
Abstract
Graph-based indexes have been widely employed to accelerate approximate similarity search of high-dimensional vectors. However, the performance of graph indexes to answer different queries varies vastly, leading to an unstable quality of service for downstream applications. This necessitates an effective measure to test query hardness on graph indexes. Nonetheless, popular distance-based hardness measures like LID lose their effects due to the ignorance of the graph structure. In this paper, we propose $Steiner$-hardness, a novel connection-based graph-native query hardness measure. Specifically, we first propose a theoretical framework to analyze the minimum query effort on graph indexes and then define $Steiner$-hardness as the minimum effort on a representative graph. Moreover, we prove that our $Steiner$-hardness is highly relevant to the classical Directed $Steiner$ Tree (DST) problems. In this case, we design a novel algorithm to reduce our problem to DST problems and then leverage their solvers to help calculate $Steiner$-hardness efficiently. Compared with LID and other similar measures, $Steiner$-hardness shows a significantly better correlation with the actual query effort on various datasets. Additionally, an unbiased evaluation designed based on $Steiner$-hardness reveals new ranking results, indicating a meaningful direction for enhancing the robustness of graph indexes. This paper is accepted by PVLDB 2025.
