MBMamba: When Memory Buffer Meets Mamba for Structure-Aware Image Deblurring
Hu Gao, Xiaoning Lei, Xichen Xu, Depeng Dang, Lizhuang Ma
TL;DR
MBMamba introduces MemVSSM, a memory-buffered, chunk-wise vision state space module, and an Ising-inspired regularization loss to address local detail loss and structural incoherence in Mamba-based deblurring. By dividing features into chunks with a FIFO memory bank and cross-attention fusion, MBMamba preserves both global context and local structure without adding extra scans, maintaining real-time efficiency. The Ising loss further enforces spatial coherence across neighboring pixels, improving texture and edge preservation. Experimental results on GoPro, HIDE, and RealBlur show state-of-the-art PSNR/SSIM gains with substantial reductions in compute and faster inference, validating both performance and practicality of the approach.
Abstract
The Mamba architecture has emerged as a promising alternative to CNNs and Transformers for image deblurring. However, its flatten-and-scan strategy often results in local pixel forgetting and channel redundancy, limiting its ability to effectively aggregate 2D spatial information. Although existing methods mitigate this by modifying the scan strategy or incorporating local feature modules, it increase computational complexity and hinder real-time performance. In this paper, we propose a structure-aware image deblurring network without changing the original Mamba architecture. Specifically, we design a memory buffer mechanism to preserve historical information for later fusion, enabling reliable modeling of relevance between adjacent features. Additionally, we introduce an Ising-inspired regularization loss that simulates the energy minimization of the physical system's "mutual attraction" between pixels, helping to maintain image structure and coherence. Building on this, we develop MBMamba. Experimental results show that our method outperforms state-of-the-art approaches on widely used benchmarks.
