Unsupervised Deep Graph Matching Based on Cycle Consistency
Siddharth Tourani, Carsten Rother, Muhammad Haris Khan, Bogdan Savchynskyy
TL;DR
The paper tackles unsupervised deep graph matching for keypoint correspondence by introducing a discrete cycle-consistency loss as supervision signal. It leverages black-box differentiation to backprop through combinatorial solvers for LAP and QAP, enabling end-to-end training with arbitrary neural architectures. A flexible network architecture combines a VGG16 backbone, SplineCNN refinements, and self- and cross-attention to produce unary and edge costs for matching, achieving state-of-the-art performance on standard benchmarks without ground-truth matches. Empirical results on Pascal VOC, Willow, and SPair-71K demonstrate robust unsupervised performance and the importance of attention mechanisms, with ablations highlighting the value of cycle-consistency and solver-agnostic design.
Abstract
We contribute to the sparsely populated area of unsupervised deep graph matching with application to keypoint matching in images. Contrary to the standard \emph{supervised} approach, our method does not require ground truth correspondences between keypoint pairs. Instead, it is self-supervised by enforcing consistency of matchings between images of the same object category. As the matching and the consistency loss are discrete, their derivatives cannot be straightforwardly used for learning. We address this issue in a principled way by building our method upon the recent results on black-box differentiation of combinatorial solvers. This makes our method exceptionally flexible, as it is compatible with arbitrary network architectures and combinatorial solvers. Our experimental evaluation suggests that our technique sets a new state-of-the-art for unsupervised graph matching.
