Phaseformer: Phase-based Attention Mechanism for Underwater Image Restoration and Beyond
MD Raqib Khan, Anshul Negi, Ashutosh Kulkarni, Shruti S. Phutke, Santosh Kumar Vipparthi, Subrahmanyam Murala
TL;DR
Phaseformer addresses underwater image restoration by leveraging a phase-based transformer that emphasizes structure via phase information. It introduces a Phase-based Transformer Block with phase-derived queries/keys and an Optimized Phase Attention Block to propagate encoder features through phase-aware skip connections, coupled with learnable loss weights to balance multiple objectives. The approach achieves strong performance on synthetic and real underwater datasets while remaining lightweight (approximately $1.77\times 10^6$ parameters and $\sim13.0\times 10^9$ FLOPs), and also demonstrates applicability to low-light enhancement and downstream vision tasks. This combination of efficiency, phase-informed attention, and adaptive optimization yields practical benefits for autonomous underwater systems and related image enhancement domains.
Abstract
Quality degradation is observed in underwater images due to the effects of light refraction and absorption by water, leading to issues like color cast, haziness, and limited visibility. This degradation negatively affects the performance of autonomous underwater vehicles used in marine applications. To address these challenges, we propose a lightweight phase-based transformer network with 1.77M parameters for underwater image restoration (UIR). Our approach focuses on effectively extracting non-contaminated features using a phase-based self-attention mechanism. We also introduce an optimized phase attention block to restore structural information by propagating prominent attentive features from the input. We evaluate our method on both synthetic (UIEB, UFO-120) and real-world (UIEB, U45, UCCS, SQUID) underwater image datasets. Additionally, we demonstrate its effectiveness for low-light image enhancement using the LOL dataset. Through extensive ablation studies and comparative analysis, it is clear that the proposed approach outperforms existing state-of-the-art (SOTA) methods.
