WaterWave: Bridging Underwater Image Enhancement into Video Streams via Wavelet-based Temporal Consistency Field
Qi Zhu, Jingyi Zhang, Naishan Zheng, Wei Yu, Jinghao Zhang, Deyi Ji, Feng Zhao
TL;DR
The paper addresses the challenge of enhancing underwater videos without paired data and with temporal inconsistency from frame-wise UWIE methods. It introduces WaterWave, an implicit neural representation framework that enforces temporal coherence through a wavelet-based temporal field, aided by Transmission-guided Flow Rectification and a Video Consistency-aware Wavelet block. By leveraging 3D multi-scale hash encoding and wavelet lifting, WaterWave decouples temporal inconsistency from content, achieving higher fidelity and smoother video, with demonstrated gains in downstream underwater tracking. The approach shows strong potential for improving video-level quality in marine applications and provides a pathway for training in data-scarce, real-world underwater scenarios.
Abstract
Underwater video pairs are fairly difficult to obtain due to the complex underwater imaging. In this case, most existing video underwater enhancement methods are performed by directly applying the single-image enhancement model frame by frame, but a natural issue is lacking temporal consistency. To relieve the problem, we rethink the temporal manifold inherent in natural videos and observe a temporal consistency prior in dynamic scenes from the local temporal frequency perspective. Building upon the specific prior and no paired-data condition, we propose an implicit representation manner for enhanced video signals, which is conducted in the wavelet-based temporal consistency field, WaterWave. Specifically, under the constraints of the prior, we progressively filter and attenuate the inconsistent components while preserving motion details and scenes, achieving a natural-flowing video. Furthermore, to represent temporal frequency bands more accurately, an underwater flow correction module is designed to rectify estimated flows considering the transmission in underwater scenes. Extensive experiments demonstrate that WaterWave significantly enhances the quality of videos generated using single-image underwater enhancements. Additionally, our method demonstrates high potential in downstream underwater tracking tasks, such as UOSTrack and MAT, outperforming the original video by a large margin, i.e., 19.7% and 9.7% on precise respectively.
