MonoWAD: Weather-Adaptive Diffusion Model for Robust Monocular 3D Object Detection
Youngmin Oh, Hyung-Il Kim, Seong Tae Kim, Jung Uk Kim
TL;DR
Monocular 3D object detection often fails under adverse weather like fog. The authors propose MonoWAD, a weather-robust detector that combines a weather codebook to memorize clear-weather knowledge and generate weather-reference features with a weather-adaptive diffusion model that uses the fog distribution $\mathcal{F}=x^f-x^c$ to progressively enhance feature representations. Two losses, $\mathcal{L}_{ckr}$ and $\mathcal{L}_{wae}$, guide the codebook and diffusion model, and standard detection loss $\mathcal{L}_{OD}$ completes the end-to-end objective $\mathcal{L}_{Total}$. Experiments on KITTI, Foggy KITTI, and Virtual KITTI show improved weather robustness, outperforming state-of-the-art monocular detectors under foggy and mixed conditions, with code and data released for reproducibility. This approach offers a practical path toward reliable perception in real-world autonomous systems across diverse weather scenarios.
Abstract
Monocular 3D object detection is an important challenging task in autonomous driving. Existing methods mainly focus on performing 3D detection in ideal weather conditions, characterized by scenarios with clear and optimal visibility. However, the challenge of autonomous driving requires the ability to handle changes in weather conditions, such as foggy weather, not just clear weather. We introduce MonoWAD, a novel weather-robust monocular 3D object detector with a weather-adaptive diffusion model. It contains two components: (1) the weather codebook to memorize the knowledge of the clear weather and generate a weather-reference feature for any input, and (2) the weather-adaptive diffusion model to enhance the feature representation of the input feature by incorporating a weather-reference feature. This serves an attention role in indicating how much improvement is needed for the input feature according to the weather conditions. To achieve this goal, we introduce a weather-adaptive enhancement loss to enhance the feature representation under both clear and foggy weather conditions. Extensive experiments under various weather conditions demonstrate that MonoWAD achieves weather-robust monocular 3D object detection. The code and dataset are released at https://github.com/VisualAIKHU/MonoWAD.
