Wavelet-Assisted Multi-Frequency Attention Network for Pansharpening

Jie Huang; Rui Huang; Jinghao Xu; Siran Pen; Yule Duan; Liangjian Deng

Wavelet-Assisted Multi-Frequency Attention Network for Pansharpening

Jie Huang, Rui Huang, Jinghao Xu, Siran Pen, Yule Duan, Liangjian Deng

TL;DR

This work tackles pansharpening by introducing a frequency-domain fusion strategy that preserves spectral and spatial details. It proposes WFANet, which combines Multi-Frequency Fusion Attention (MFFA) with a Spatial Detail Enhancement Module (SDEM) in a wavelet pyramid to fuse PAN and LRMS features across multiple scales, using a Frequency Attention Triplet with Frequency-Query, Spatial-Key, and Fusion-Value and lossless reconstruction via IDWT. The approach achieves state-of-the-art results on WV3, GF2, and QB datasets in both reduced- and full-resolution settings, with ablations validating the importance of each component, including the DWT-based frequency separation, the attention design, the FAB-based frequency adaptation, and the multi-scale training strategy. The work offers a practical, generalizable framework for high-quality pansharpening with strong potential for real-world remote sensing applications due to its robust frequency-aware fusion and progressive reconstruction capabilities.

Abstract

Pansharpening aims to combine a high-resolution panchromatic (PAN) image with a low-resolution multispectral (LRMS) image to produce a high-resolution multispectral (HRMS) image. Although pansharpening in the frequency domain offers clear advantages, most existing methods either continue to operate solely in the spatial domain or fail to fully exploit the benefits of the frequency domain. To address this issue, we innovatively propose Multi-Frequency Fusion Attention (MFFA), which leverages wavelet transforms to cleanly separate frequencies and enable lossless reconstruction across different frequency domains. Then, we generate Frequency-Query, Spatial-Key, and Fusion-Value based on the physical meanings represented by different features, which enables a more effective capture of specific information in the frequency domain. Additionally, we focus on the preservation of frequency features across different operations. On a broader level, our network employs a wavelet pyramid to progressively fuse information across multiple scales. Compared to previous frequency domain approaches, our network better prevents confusion and loss of different frequency features during the fusion process. Quantitative and qualitative experiments on multiple datasets demonstrate that our method outperforms existing approaches and shows significant generalization capabilities for real-world scenarios.

Wavelet-Assisted Multi-Frequency Attention Network for Pansharpening

TL;DR

Abstract

Wavelet-Assisted Multi-Frequency Attention Network for Pansharpening

TL;DR

Abstract

Paper Structure

Table of Contents

Figures (13)