HazyDet: Open-Source Benchmark for Drone-View Object Detection with Depth-Cues in Hazy Scenes
Changfeng Feng, Zhenyuan Chen, Xiang Li, Chunping Wang, Jian Yang, Ming-Ming Cheng, Yimian Dai, Qiang Fu
TL;DR
The paper tackles the challenge of drone-view object detection under hazy conditions by introducing HazyDet, a large-scale benchmark with 383k real-world and synthetically hazy instances, designed to reflect aerial perspectives and depth-related degradation. It proposes DeCoDet, a Depth-Conditioned Detector that leverages Depth-Conditioned Kernels and multi-scale depth priors to modulate feature representations based on depth cues, avoiding explicit dehazing. Training employs Progressive Domain Fine-Tuning (PDFT) to bridge synthetic-to-real domain gaps and a Scale-Invariant Refurbishment Loss (SIRLoss) to robustly learn from noisy depth annotations, yielding state-of-the-art performance with a +1.5% mAP improvement on challenging real hazy data. The dataset and toolkit are open-sourced, enabling principled evaluation and reproducible development for robust UAV perception in adverse weather. The approach emphasizes depth as a core modality for haze resilience, linking atmospheric scattering to object-scale and visibility in aerial imagery through the ASM formulation $I(x,y)=J(x,y)t(x,y)+A(1-t(x,y))$, $t(x,y)=e^{-eta d(x,y)}$.
Abstract
Object detection from aerial platforms under adverse atmospheric conditions, particularly haze, is paramount for robust drone autonomy. Yet, this domain remains largely underexplored, primarily hindered by the absence of specialized benchmarks. To bridge this gap, we present \textit{HazyDet}, the first, large-scale benchmark specifically designed for drone-view object detection in hazy conditions. Comprising 383,000 real-world instances derived from both naturally hazy captures and synthetically hazed scenes augmented from clear images, HazyDet provides a challenging and realistic testbed for advancing detection algorithms. To address the severe visual degradation induced by haze, we propose the Depth-Conditioned Detector (DeCoDet), a novel architecture that integrates a Depth-Conditioned Kernel to dynamically modulate feature representations based on depth cues. The practical efficacy and robustness of DeCoDet are further enhanced by its training with a Progressive Domain Fine-Tuning (PDFT) strategy to navigate synthetic-to-real domain shifts, and a Scale-Invariant Refurbishment Loss (SIRLoss) to ensure resilient learning from potentially noisy depth annotations. Comprehensive empirical validation on HazyDet substantiates the superiority of our unified DeCoDet framework, which achieves state-of-the-art performance, surpassing the closest competitor by a notable +1.5\% mAP on challenging real-world hazy test scenarios. Our dataset and toolkit are available at https://github.com/GrokCV/HazyDet.
