Accurate and Fast Pixel Retrieval with Spatial and Uncertainty Aware Hypergraph Diffusion
Guoyuan An, Yuchi Huo, Sung-Eui Yoon
TL;DR
The paper tackles the challenge of fast and accurate pixel retrieval in large image databases, where diffusion on scalar graph edges can mispropagate spatial information. It introduces a spatially aware hypergraph diffusion (HD) built on a kNN image graph, with inter-image and intra-image hyperedges to propagate local spatial cues offline and a community-selection mechanism to predict retrieval uncertainty online. HD achieves state-of-the-art image-level and pixel-level performance on ROxford/Paris benchmarks, while offering strong speed and memory efficiency, and its convergence follows the form $( ext{I}- ext{P}')^{-1} ext{Y}^0$. The work also demonstrates practical gains in real-world retrieval by reducing the reliance on expensive spatial verification through uncertainty-driven initialization, making it attractive for scalable search systems.
Abstract
This paper presents a novel method designed to enhance the efficiency and accuracy of both image retrieval and pixel retrieval. Traditional diffusion methods struggle to propagate spatial information effectively in conventional graphs due to their reliance on scalar edge weights. To overcome this limitation, we introduce a hypergraph-based framework, uniquely capable of efficiently propagating spatial information using local features during query time, thereby accurately retrieving and localizing objects within a database. Additionally, we innovatively utilize the structural information of the image graph through a technique we term "community selection". This approach allows for the assessment of the initial search result's uncertainty and facilitates an optimal balance between accuracy and speed. This is particularly crucial in real-world applications where such trade-offs are often necessary. Our experimental results, conducted on the (P)ROxford and (P)RParis datasets, demonstrate the significant superiority of our method over existing diffusion techniques. We achieve state-of-the-art (SOTA) accuracy in both image-level and pixel-level retrieval, while also maintaining impressive processing speed. This dual achievement underscores the effectiveness of our hypergraph-based framework and community selection technique, marking a notable advancement in the field of content-based image retrieval.
