Underwater Variable Zoom: Depth-Guided Perception Network for Underwater Image Enhancement

Zhixiong Huang; Xinying Wang; Chengpei Xu; Jinjiang Li; Lin Feng

Underwater Variable Zoom: Depth-Guided Perception Network for Underwater Image Enhancement

Zhixiong Huang, Xinying Wang, Chengpei Xu, Jinjiang Li, Lin Feng

TL;DR

A novel depth-guided perception UIE framework, dubbed underwater variable zoom (UVZ), which parses near-far scenarios by harnessing the predicted depth maps, enabling local and non-local perceiving in different regions.

Abstract

Underwater scenes intrinsically involve degradation problems owing to heterogeneous ocean elements. Prevailing underwater image enhancement (UIE) methods stick to straightforward feature modeling to learn the mapping function, which leads to limited vision gain as it lacks more explicit physical cues (e.g., depth). In this work, we investigate injecting the depth prior into the deep UIE model for more precise scene enhancement capability. To this end, we present a novel depth-guided perception UIE framework, dubbed underwater variable zoom (UVZ). Specifically, UVZ resorts to a two-stage pipeline. First, a depth estimation network is designed to generate critical depth maps, combined with an auxiliary supervision network introduced to suppress estimation differences during training. Second, UVZ parses near-far scenarios by harnessing the predicted depth maps, enabling local and non-local perceiving in different regions. Extensive experiments on five benchmark datasets demonstrate that UVZ achieves superior visual gain and delivers promising quantitative metrics. Besides, UVZ is confirmed to exhibit good generalization in some visual tasks, especially in unusual lighting conditions. The code, models and results are available at: https://github.com/WindySprint/UVZ.

Underwater Variable Zoom: Depth-Guided Perception Network for Underwater Image Enhancement

TL;DR

Abstract

Paper Structure (17 sections, 9 equations, 17 figures, 4 tables)

This paper contains 17 sections, 9 equations, 17 figures, 4 tables.

Introduction
Related work
Traditional Methods
Deep Learning Methods
Depth-related Methods
Methodology
Overall Architecture
Depth Perception Module
Dual-Attention Module
Training Strategy
Experiment
Implementation Details
Comparison on reference dataset
Comparison on no-reference dataset
Ablation Study
...and 2 more sections

Figures (17)

Figure 1: Near-far scenarios enhancement comparison by different methods, where the orange box represents the enlarged near scene and the red box represents the enlarged far scene. The top row shows the raw image, the results of L$^2$UWE 5, CBM 6, TEBCF 7, WWPE 8, and Water-Net 9. The bottom row shows the results of SCNet 10, TACL 11, USUIR 12, PUGAN 13, Spectroformer 14, and our UVZ.
Figure 2: Motivation of our proposed depth-guided perception. The degradation degree varies significantly between the near scene and far scene. In comparison, the visual quality of the former and the adjacent regions exhibits relatively better performance. For the two scenes, we should implement different feature learning strategies.
Figure 3: The two-stage framework of UVZ, including DEN, DGEN, and ASN. All sub-networks adopt a standard encoder/decoder architecture, where the red slash indicates that ASN is only used for the training. In the first stage, for a raw image $X$, DEN and ASN generate the depth map $d$ and the regression image $\widehat{X}$, respectively. For the second stage, with inputs $X$ and $d$, DGEN generates the enhanced image $Y$.
Figure 4: R$^3$S transformation of a depth map. The input is the depth map d and the outputs are the feature maps $d_{rev}$, $d_1$, $d_3$, and $d_5$. The process within the dashed line is a 3 × 3 region smoothing (RS).
Figure 5: Structure of the Dual-Attention Module (DAM)
...and 12 more figures

Underwater Variable Zoom: Depth-Guided Perception Network for Underwater Image Enhancement

TL;DR

Abstract

Underwater Variable Zoom: Depth-Guided Perception Network for Underwater Image Enhancement

Authors

TL;DR

Abstract

Table of Contents

Figures (17)