Semantic Correspondence: Unified Benchmarking and a Strong Baseline
Kaiyan Zhang, Xinghui Li, Jingyi Lu, Kai Han
TL;DR
This work provides a holistic survey and benchmarking framework for semantic correspondence, classifying methods into handcrafted descriptors, architectural improvements, and training strategies. It demonstrates that backbone quality and fine-tuning are the dominant factors in performance, and shows that a simple baseline combining strong backbones with targeted refinement achieves state-of-the-art results on multiple benchmarks. The study offers extensive controlled experiments across datasets and resolutions, and proposes a unified benchmark to enable fair comparisons. The findings emphasize backbone-driven gains and point to future directions in foundation-model adaptation and more scalable supervisory signals for semantic matching.
Abstract
Establishing semantic correspondence is a challenging task in computer vision, aiming to match keypoints with the same semantic information across different images. Benefiting from the rapid development of deep learning, remarkable progress has been made over the past decade. However, a comprehensive review and analysis of this task remains absent. In this paper, we present the first extensive survey of semantic correspondence methods. We first propose a taxonomy to classify existing methods based on the type of their method designs. These methods are then categorized accordingly, and we provide a detailed analysis of each approach. Furthermore, we aggregate and summarize the results of methods in literature across various benchmarks into a unified comparative table, with detailed configurations to highlight performance variations. Additionally, to provide a detailed understanding on existing methods for semantic matching, we thoroughly conduct controlled experiments to analyse the effectiveness of the components of different methods. Finally, we propose a simple yet effective baseline that achieves state-of-the-art performance on multiple benchmarks, providing a solid foundation for future research in this field. We hope this survey serves as a comprehensive reference and consolidated baseline for future development. Code is publicly available at: https://github.com/Visual-AI/Semantic-Correspondence.
