HIDFlowNet: A Flow-Based Deep Network for Hyperspectral Image Denoising
Qizhou Wang, Li Pang, Xiangyong Cao, Zhiqiang Tian, Deyu Meng
TL;DR
The paper tackles the ill-posed nature of hyperspectral image denoising by learning the conditional distribution $P(X|Y)$ instead of a single deterministic mapping. It introduces HIDFlowNet, a flow-based architecture with a non-invertible conditional encoder to capture global low-frequency information and an invertible decoder to generate local high-frequency details, enabling diverse clean HSI samples by drawing $z\sim p_z$ and applying inverse transforms. The model optimizes a negative log-likelihood term plus a reconstruction loss, leveraging a diagonal Jacobian for efficient training, and demonstrates robust performance on synthetic and real datasets with stable, diverse restorations. This approach provides a principled way to handle denoising as sampling from a conditional distribution, improving detail preservation while offering multiple valid clean reconstructions for the same noisy input.
Abstract
Hyperspectral image (HSI) denoising is essentially ill-posed since a noisy HSI can be degraded from multiple clean HSIs. However, existing deep learning (DL)-based approaches only restore one clean HSI from the given noisy HSI with a deterministic mapping, thus ignoring the ill-posed issue and always resulting in an over-smoothing problem. Additionally, these DL-based methods often neglect that noise is part of the high-frequency component and their network architectures fail to decouple the learning of low-frequency and high-frequency. To alleviate these issues, this paper proposes a flow-based HSI denoising network (HIDFlowNet) to directly learn the conditional distribution of the clean HSI given the noisy HSI and thus diverse clean HSIs can be sampled from the conditional distribution. Overall, our HIDFlowNet is induced from the generative flow model and is comprised of an invertible decoder and a conditional encoder, which can explicitly decouple the learning of low-frequency and high-frequency information of HSI. Specifically, the invertible decoder is built by staking a succession of invertible conditional blocks (ICBs) to capture the local high-frequency details. The conditional encoder utilizes down-sampling operations to obtain low-resolution images and uses transformers to capture correlations over a long distance so that global low-frequency information can be effectively extracted. Extensive experiments on simulated and real HSI datasets verify that our proposed HIDFlowNet can obtain better or comparable results compared with other state-of-the-art methods.
