S2WMamba: A Spectral-Spatial Wavelet Mamba for Pansharpening
Haoyu Zhang, Junhan Luo, Yugang Cao, Siran Peng, Jie Huang, Liangjian-Deng
TL;DR
S2WMamba addresses the persistent spatial–spectral trade-off in pansharpening by disentangling frequency information in the wavelet domain. It introduces a dual-branch framework with a 2D Haar DWT-guided Spectral Branch and a channel-wise 1D Haar DWT-guided Spatial Branch, coupled through FMamba cross-modal interactions and a Multi-Scale Dynamic Gate for adaptive fusion. The approach demonstrates state-of-the-art or competitive performance on WV3, GF2, and QB with strong efficiency, supported by extensive ablations that justify the dual-branch design, Mamba backbone, and fusion strategy. The work offers a principled wavelet-based fusion paradigm that leverages long-range modeling for robust, high-fidelity HRMS pansharpening with practical implications for remote sensing applications.
Abstract
Pansharpening fuses a high-resolution PAN image with a low-resolution multispectral (LRMS) image to produce an HRMS image. A key difficulty is that jointly processing PAN and MS often entangles spatial detail with spectral fidelity. We propose S2WMamba, which explicitly disentangles frequency information and then performs lightweight cross-modal interaction. Concretely, a 2D Haar DWT is applied to PAN to localize spatial edges and textures, while a channel-wise 1D Haar DWT treats each pixel's spectrum as a 1D signal to separate low/high-frequency components and limit spectral distortion. The resulting Spectral branch injects wavelet-extracted spatial details into MS features, and the Spatial branch refines PAN features using spectra from the 1D pyramid; the two branches exchange information through Mamba-based cross-modulation that models long-range dependencies with linear complexity. A multi-scale dynamic gate (multiplicative + additive) then adaptively fuses branch outputs.On WV3, GF2, and QB, S2WMamba matches or surpasses recent strong baselines (FusionMamba, CANNet, U2Net, ARConv), improving PSNR by up to 0.23 dB and reaching HQNR 0.956 on full-resolution WV3. Ablations justify the choice of 2D/1D DWT placement, parallel dual branches, and the fusion gate. Our code is available at https://github.com/KagUYa66/S2WMamba.
