Burst Image Super-Resolution with Mamba
Ozan Unal, Steven Marty, Dengxin Dai
TL;DR
BurstMamba addresses the inefficiency of burst image super-resolution by decoupling keyframe SR from burst-based subpixel priors using a Mamba-based backbone with linear time complexity. The method introduces two innovations: Optical Flow-Based Serialization (OFS), which aligns burst information only during inter-frame state updates to preserve high-frequency details, and a Wavelet-based State-Space Update (ψS6) that prioritizes high-frequency features for burst-to-keyframe information transfer. Empirical results on SyntheticSR, RealBSR-RGB, and RealBSR-RAW show state-of-the-art performance, with ablations confirming substantial gains from the temporal subpixel prior, OFS, and ψS6, and demonstrated robustness to varying burst lengths. The approach offers scalable deployment flexibility and potential applicability to related burst enhancement tasks such as deblurring and denoising.
Abstract
Burst image super-resolution (BISR) aims to enhance the resolution of a keyframe by leveraging information from multiple low-resolution images captured in quick succession. In the deep learning era, BISR methods have evolved from fully convolutional networks to transformer-based architectures, which, despite their effectiveness, suffer from the quadratic complexity of self-attention. We see Mamba as the next natural step in the evolution of this field, offering a comparable global receptive field and selective information routing with only linear time complexity. In this work, we introduce BurstMamba, a Mamba-based architecture for BISR. Our approach decouples the task into two specialized branches: a spatial module for keyframe super-resolution and a temporal module for subpixel prior extraction, striking a balance between computational efficiency and burst information integration. To further enhance burst processing with Mamba, we propose two novel strategies: (i) optical flow-based serialization, which aligns burst sequences only during state updates to preserve subpixel details, and (ii) a wavelet-based reparameterization of the state-space update rules, prioritizing high-frequency features for improved burst-to-keyframe information passing. Our framework achieves SOTA performance on public benchmarks of SyntheticSR, RealBSR-RGB, and RealBSR-RAW.
