Cluster-Aware Similarity Diffusion for Instance Retrieval
Jifei Luo, Hantao Yao, Changsheng Xu
TL;DR
The paper tackles diffusion-based re-ranking for instance retrieval, where full-graph diffusion can spread misinformation from outliers and inter-manifold samples. It introduces Cluster-Aware Similarity (CAS) diffusion, combining Bidirectional Similarity Diffusion to perform local, symmetric diffusion and Neighbor-guided Similarity Smooth to enforce neighbor-consistency, with an efficient convex formulation and iterative solutions. The framework yields state-of-the-art re-ranking performance on standard benchmarks for both image retrieval and person re-identification, demonstrated through extensive ablations and comparisons with QE-based, diffusion-based, and learning-based methods. The approach offers a robust, scalable alternative to global diffusion by leveraging local clusters and neighborhood structure, with publicly available code for reproducibility and adoption in practice.
Abstract
Diffusion-based re-ranking is a common method used for retrieving instances by performing similarity propagation in a nearest neighbor graph. However, existing techniques that construct the affinity graph based on pairwise instances can lead to the propagation of misinformation from outliers and other manifolds, resulting in inaccurate results. To overcome this issue, we propose a novel Cluster-Aware Similarity (CAS) diffusion for instance retrieval. The primary concept of CAS is to conduct similarity diffusion within local clusters, which can reduce the influence from other manifolds explicitly. To obtain a symmetrical and smooth similarity matrix, our Bidirectional Similarity Diffusion strategy introduces an inverse constraint term to the optimization objective of local cluster diffusion. Additionally, we have optimized a Neighbor-guided Similarity Smoothing approach to ensure similarity consistency among the local neighbors of each instance. Evaluations in instance retrieval and object re-identification validate the effectiveness of the proposed CAS, our code is publicly available.
