Matching Semantically Similar Non-Identical Objects
Yusuke Marumo, Kazuhiko Kawamoto, Satomi Tanaka, Shigenobu Hirano, Hiroshi Kera
TL;DR
This work tackles pixel-level matching between semantically similar but non-identical objects, addressing challenges from class discrepancy and domain shift. It proposes a plug-and-play Semantic Enhancement Weighting (SEW) module that uses object detector heatmaps and Grad-CAM to reweight sparse descriptors, and a Non-visual Object Pairing mechanism to select appropriate object pairs when multiple objects are present. The approach extends existing sparse matchers without training and is evaluated with a new annotation-free metric, Triangular Matching Consistency (TMC), as well as relative pose accuracy under corruptions; results show notable improvements over strong baselines. The work demonstrates robustness across in-class variations, domain shifts, and cross-domain drawings, offering practical impact for fine-grained, cross-object correspondence tasks and potential downstream applications such as landmark transfer and assembly guidance, while maintaining real-time feasibility through plug-and-play integration.
Abstract
Not identical but similar objects are ubiquitous in our world, ranging from four-legged animals such as dogs and cats to cars of different models and flowers of various colors. This study addresses a novel task of matching such non-identical objects at the pixel level. We propose a weighting scheme of descriptors, Semantic Enhancement Weighting (SEW), that incorporates semantic information from object detectors into existing sparse feature matching methods, extending their targets from identical objects captured from different perspectives to semantically similar objects. The experiments show successful matching between non-identical objects in various cases, including in-class design variations, class discrepancy, and domain shifts (e.g., photo vs. drawing and image corruptions). The code is available at https://github.com/Circ-Leaf/NIOM .
