Ensemble Quadratic Assignment Network for Graph Matching
Haoru Tan, Chuang Wang, Sitong Wu, Xu-Yao Zhang, Fei Yin, Cheng-Lin Liu
TL;DR
The paper presents EQAN, a multi-channel GNN for graph matching that ensembles traditional QAP solvers within an end-to-end differentiable framework. By modeling each solver as a channel on the association graph and enabling inter-channel communication via 1×1 convolutions, EQAN achieves robust performance across geometric and semantic matching tasks while maintaining scalability through a random-sampling strategy. The authors establish a differentiable proximal graph matching base, extend it to multi-channel ensembles, and demonstrate strong empirical gains over both traditional methods and prior data-driven approaches, including few-shot 3D shape classification. The approach yields improved robustness to noise, outliers, and rotations, with practical inference times on graphs with thousands of nodes. Theoretical convergence of the proximal solver and extensive ablations support the design choices and scalability claims.
Abstract
Graph matching is a commonly used technique in computer vision and pattern recognition. Recent data-driven approaches have improved the graph matching accuracy remarkably, whereas some traditional algorithm-based methods are more robust to feature noises, outlier nodes, and global transformation (e.g.~rotation). In this paper, we propose a graph neural network (GNN) based approach to combine the advantages of data-driven and traditional methods. In the GNN framework, we transform traditional graph-matching solvers as single-channel GNNs on the association graph and extend the single-channel architecture to the multi-channel network. The proposed model can be seen as an ensemble method that fuses multiple algorithms at every iteration. Instead of averaging the estimates at the end of the ensemble, in our approach, the independent iterations of the ensembled algorithms exchange their information after each iteration via a 1x1 channel-wise convolution layer. Experiments show that our model improves the performance of traditional algorithms significantly. In addition, we propose a random sampling strategy to reduce the computational complexity and GPU memory usage, so the model applies to matching graphs with thousands of nodes. We evaluate the performance of our method on three tasks: geometric graph matching, semantic feature matching, and few-shot 3D shape classification. The proposed model performs comparably or outperforms the best existing GNN-based methods.
