MambaGlue: Fast and Robust Local Feature Matching With Mamba
Kihwan Ryoo, Hyungtae Lim, Hyun Myung
TL;DR
MambaGlue addresses the need for fast, robust local feature matching by hybridizing the Mamba architecture with Transformer-style attention. It introduces two novel components—a MambaAttention mixer that selectively aggregates context over $d$-dimensional feature states $\mathbf{x}_q \in \mathbb{R}^d$ and a deep confidence score regressor that outputs a per-feature score $c_q \in (0,1)$—and an exit test for early stopping to prune unreliable features. Trained in two stages, it achieves higher accuracy with low latency on HPatches, MegaDepth-1500, and Aachen Day-Night compared with strong baselines such as LightGlue and SuperGlue. The results demonstrate practical impact for robust camera pose estimation and visual localization under challenging illumination and viewpoint changes.
Abstract
In recent years, robust matching methods using deep learning-based approaches have been actively studied and improved in computer vision tasks. However, there remains a persistent demand for both robust and fast matching techniques. To address this, we propose a novel Mamba-based local feature matching approach, called MambaGlue, where Mamba is an emerging state-of-the-art architecture rapidly gaining recognition for its superior speed in both training and inference, and promising performance compared with Transformer architectures. In particular, we propose two modules: a) MambaAttention mixer to simultaneously and selectively understand the local and global context through the Mamba-based self-attention structure and b) deep confidence score regressor, which is a multi-layer perceptron (MLP)-based architecture that evaluates a score indicating how confidently matching predictions correspond to the ground-truth correspondences. Consequently, our MambaGlue achieves a balance between robustness and efficiency in real-world applications. As verified on various public datasets, we demonstrate that our MambaGlue yields a substantial performance improvement over baseline approaches while maintaining fast inference speed. Our code will be available on https://github.com/url-kaist/MambaGlue
