CPDM: Content-Preserving Diffusion Model for Underwater Image Enhancement

Xiaowen Shi; Yuan-Gen Wang

CPDM: Content-Preserving Diffusion Model for Underwater Image Enhancement

Xiaowen Shi, Yuan-Gen Wang

TL;DR

CPDM first leverages a diffusion model as its fundamental model for stable training and then designs a content-preserving framework to deal with changes in imaging conditions, and outperforms state-of-the-art methods in both subjective and objective metrics.

Abstract

Underwater image enhancement (UIE) is challenging since image degradation in aquatic environments is complicated and changing over time. Existing mainstream methods rely on either physical-model or data-driven, suffering from performance bottlenecks due to changes in imaging conditions or training instability. In this article, we make the first attempt to adapt the diffusion model to the UIE task and propose a Content-Preserving Diffusion Model (CPDM) to address the above challenges. CPDM first leverages a diffusion model as its fundamental model for stable training and then designs a content-preserving framework to deal with changes in imaging conditions. Specifically, we construct a conditional input module by adopting both the raw image and the difference between the raw and noisy images as the input, which can enhance the model's adaptability by considering the changes involving the raw images in underwater environments. To preserve the essential content of the raw images, we construct a content compensation module for content-aware training by extracting low-level features from the raw images. Extensive experimental results validate the effectiveness of our CPDM, surpassing the state-of-the-art methods in terms of both subjective and objective metrics.

CPDM: Content-Preserving Diffusion Model for Underwater Image Enhancement

TL;DR

Abstract

Paper Structure (13 sections, 12 equations, 6 figures, 2 tables, 2 algorithms)

This paper contains 13 sections, 12 equations, 6 figures, 2 tables, 2 algorithms.

Introduction
Related Work
Underwater Image Enhancement Methods
Diffusion Model for Image Generation
Proposed method
Conditional Input Module
Content Compensation Module
Experiments
Dataset
Experimental Setting and Metrics
Results
Ablation Study
Conclusion

Figures (6)

Figure 1: A visual illustration of the proposed CPDM method. (a) Original images to be enhanced. (b) Images enhanced by our CPDM. (c) Reference images as ground truth.
Figure 2: Illustration of our conditional input module. The forward diffusion process denotes $q$ (from right to left), and the backward inference process denotes $p_{\theta_{cpdm}}$ (from left to right). $x_0$ and $y_0$ denote the clear and paired underwater images, respectively.
Figure 3: Illustration of the proposed CPDM at time step $t$. Here, $y_0$ represents the to-be-enhanced underwater image, and $x_t$ denotes the noisy image of the current time step.
Figure 4: Visual comparison of enhanced underwater images by various methods on the Test_L400 (LSUI) dataset. From left to right: original underwater image, results from WaterNet li2019underwater, FUnIE 9001231funie, Ucolor 9426457ucolor, U-shape Transformer peng2023u, our CPDM method, and the reference image.
Figure 5: Visual comparison of enhanced underwater images by various methods on the Test_U90 (UIEB) dataset. From left to right: original underwater image, results from WaterNet li2019underwater, FUnIE 9001231funie, Ucolor 9426457ucolor, U-shape Transformer peng2023u, our CPDM method, and the reference image.
...and 1 more figures

CPDM: Content-Preserving Diffusion Model for Underwater Image Enhancement

TL;DR

Abstract

CPDM: Content-Preserving Diffusion Model for Underwater Image Enhancement

Authors

TL;DR

Abstract

Table of Contents

Figures (6)