AIR-HLoc: Adaptive Retrieved Images Selection for Efficient Visual Localisation
Changkun Liu, Jianhao Jiao, Huajian Huang, Zhengyang Ma, Dimitrios Kanoulas, Tristan Braud
TL;DR
AIR-HLoc introduces adaptive retrieved image selection for hierarchical visual localisation by leveraging query-database global descriptor similarity to adjust the number of retrieved references per query. It demonstrates a strong link between global similarity and local feature match proportion, and defines a per-query score $S(I^q)$ to drive retrieval and a mean localisation improvement per retrieved image (MLIP) metric to quantify contributions. Across Cambridge Landmarks, 7Scenes, and Aachen Day-Night-v1.1, AIR-HLoc achieves state-of-the-art pose accuracy while reducing feature matching cost by up to 30%, with substantial latency gains on edge hardware. This work provides practical insights for per-query $k$ selection and opens avenues for further latency-sensitive localisation, supported by formulas such as $S(I^q) = \frac{1}{3} \sum_{j \in J} \cos(g^{q}, g^{j})$ with $n=3$ and the MLIP definitions $\zeta_T(k)$ and $\zeta_R(k)$.
Abstract
State-of-the-art hierarchical localisation pipelines (HLoc) employ image retrieval (IR) to establish 2D-3D correspondences by selecting the top-$k$ most similar images from a reference database. While increasing $k$ improves localisation robustness, it also linearly increases computational cost and runtime, creating a significant bottleneck. This paper investigates the relationship between global and local descriptors, showing that greater similarity between the global descriptors of query and database images increases the proportion of feature matches. Low similarity queries significantly benefit from increasing $k$, while high similarity queries rapidly experience diminishing returns. Building on these observations, we propose an adaptive strategy that adjusts $k$ based on the similarity between the query's global descriptor and those in the database, effectively mitigating the feature-matching bottleneck. Our approach optimizes processing time without sacrificing accuracy. Experiments on three indoor and outdoor datasets show that AIR-HLoc reduces feature matching time by up to 30\%, while preserving state-of-the-art accuracy. The results demonstrate that AIR-HLoc facilitates a latency-sensitive localisation system.
