To Match or Not to Match: Revisiting Image Matching for Reliable Visual Place Recognition
Davide Sferrazza, Gabriele Berton, Gabriele Trivigno, Carlo Masone
TL;DR
This work questions the universal benefit of re-ranking in Visual Place Recognition by showing that state-of-the-art retrieval can saturate benchmarks, sometimes making re-ranking detrimental. It proposes image matching as a verification step to estimate retrieval uncertainty, using inlier counts to decide when re-ranking should be applied. A comprehensive benchmark evaluates many image-matching methods across diverse VPR datasets, revealing that inlier-based uncertainty often correlates with potential gains from re-ranking, especially in challenging conditions, while near-perfect retrieval may degrade with post-processing. The findings advocate for adaptive VPR pipelines that selectively employ matching-based verification, improving robustness and efficiency in real-world localization tasks.
Abstract
Visual Place Recognition (VPR) is a critical task in computer vision, traditionally enhanced by re-ranking retrieval results with image matching. However, recent advancements in VPR methods have significantly improved performance, challenging the necessity of re-ranking. In this work, we show that modern retrieval systems often reach a point where re-ranking can degrade results, as current VPR datasets are largely saturated. We propose using image matching as a verification step to assess retrieval confidence, demonstrating that inlier counts can reliably predict when re-ranking is beneficial. Our findings shift the paradigm of retrieval pipelines, offering insights for more robust and adaptive VPR systems. The code is available at https://github.com/FarInHeight/To-Match-or-Not-to-Match.
