Robust Monocular Depth Estimation under Challenging Conditions
Stefano Gasperini, Nils Morbitzer, HyunJun Jung, Nassir Navab, Federico Tombari
TL;DR
This paper tackles the problem of robust monocular depth estimation under challenging illumination and weather, where conventional self-supervised and supervised methods falter. It introduces md4all, a simple training-time strategy that leverages image-to-image translations from easy day-like conditions to adverse ones, while computing losses on the original easy samples to preserve reliable signals. The authors present two self-supervised variants—Always Daytime (AD) and Day Distillation (DD)—and extend the approach to supervised learning, showing consistent improvements across nuScenes and Oxford RobotCar datasets in night, rain, and standard conditions, with no inference-time changes to the model. The approach yields substantial performance gains over state-of-the-art baselines, demonstrates strong qualitative improvements, and provides open-source translations to facilitate further research and deployment in safety-critical applications.
Abstract
While state-of-the-art monocular depth estimation approaches achieve impressive results in ideal settings, they are highly unreliable under challenging illumination and weather conditions, such as at nighttime or in the presence of rain. In this paper, we uncover these safety-critical issues and tackle them with md4all: a simple and effective solution that works reliably under both adverse and ideal conditions, as well as for different types of learning supervision. We achieve this by exploiting the efficacy of existing methods under perfect settings. Therefore, we provide valid training signals independently of what is in the input. First, we generate a set of complex samples corresponding to the normal training ones. Then, we train the model by guiding its self- or full-supervision by feeding the generated samples and computing the standard losses on the corresponding original images. Doing so enables a single model to recover information across diverse conditions without modifications at inference time. Extensive experiments on two challenging public datasets, namely nuScenes and Oxford RobotCar, demonstrate the effectiveness of our techniques, outperforming prior works by a large margin in both standard and challenging conditions. Source code and data are available at: https://md4all.github.io.
