FinSight-Net:A Physics-Aware Decoupled Network with Frequency-Domain Compensation for Underwater Fish Detection in Smart Aquaculture
Jinsong Yang, Zeyuan Hu, Yichen Li, Hong Yu
TL;DR
FinSight-Net tackles underwater fish detection under challenging optical constraints by introducing physics-aware MS-DDSP bottlenecks and EPA-FPN detail restoration. The approach decouples frequency-domain degradations and reconstitutes high-frequency spatial cues, enabling robust, real-time detection on edge platforms. Across DeepFish, AquaFishSet, and UW-BlurredFish, it achieves state-of-the-art $mAP_{50}$ (e.g., 92.8% on UW-BlurredFish) with substantially fewer parameters, demonstrating strong generalization to turbid and non-rigid aquatic targets. This work provides a practical, efficient solution for smart aquaculture monitoring and marine ecological sensing.
Abstract
Underwater fish detection (UFD) is a core capability for smart aquaculture and marine ecological monitoring. While recent detectors improve accuracy by stacking feature extractors or introducing heavy attention modules, they often incur substantial computational overhead and, more importantly, neglect the physics that fundamentally limits UFD: wavelength-dependent absorption and turbidity-induced scattering significantly degrade contrast, blur fine structures, and introduce backscattering noise, leading to unreliable localization and recognition. To address these challenges, we propose FinSight-Net, an efficient and physics-aware detection framework tailored for complex aquaculture environments. FinSight-Net introduces a Multi-Scale Decoupled Dual-Stream Processing (MS-DDSP) bottleneck that explicitly targets frequency-specific information loss via heterogeneous convolutional branches, suppressing backscattering artifacts while compensating distorted biological cues through scale-aware and channel-weighted pathways. We further design an Efficient Path Aggregation FPN (EPA-FPN) as a detail-filling mechanism: it restores high-frequency spatial information typically attenuated in deep layers by establishing long-range skip connections and pruning redundant fusion routes, enabling robust detection of non-rigid fish targets under severe blur and turbidity. Extensive experiments on DeepFish, AquaFishSet, and our challenging UW-BlurredFish benchmark demonstrate that FinSight-Net achieves state-of-the-art performance. In particular, on UW-BlurredFish, FinSight-Net reaches 92.8% mAP, outperforming YOLOv11s by 4.8% while reducing parameters by 29.0%, providing a strong and lightweight solution for real-time automated monitoring in smart aquaculture.
