Consistency Regularisation for Unsupervised Domain Adaptation in Monocular Depth Estimation
Amir El-Ghoussani, Julia Hornauer, Gustavo Carneiro, Vasileios Belagiannis
TL;DR
This work tackles unsupervised domain adaptation for monocular depth estimation by reframing it as a consistency-based semi-supervised problem that only requires source-domain labels. It introduces a single-model approach combining a pairwise source loss with multi-view perturbation consistency on unlabelled target data, inspired by FixMatch but adapted for continuous depth regression. The method employs RandAugment-based target perturbations and a CutMix-based pretraining stage, resulting in a total loss that balances supervised and unsupervised objectives via a batch-ratio parameter. Across outdoor (virtual KITTI to KITTI) and indoor (SceneNet to NYUv2) benchmarks, the approach achieves state-of-the-art domain-adaptation performance and is validated through thorough ablations. The proposed framework offers a simple, effective alternative to complex multi-model pipelines, with practical impact for deploying depth estimation systems across diverse environments.
Abstract
In monocular depth estimation, unsupervised domain adaptation has recently been explored to relax the dependence on large annotated image-based depth datasets. However, this comes at the cost of training multiple models or requiring complex training protocols. We formulate unsupervised domain adaptation for monocular depth estimation as a consistency-based semi-supervised learning problem by assuming access only to the source domain ground truth labels. To this end, we introduce a pairwise loss function that regularises predictions on the source domain while enforcing perturbation consistency across multiple augmented views of the unlabelled target samples. Importantly, our approach is simple and effective, requiring only training of a single model in contrast to the prior work. In our experiments, we rely on the standard depth estimation benchmarks KITTI and NYUv2 to demonstrate state-of-the-art results compared to related approaches. Furthermore, we analyse the simplicity and effectiveness of our approach in a series of ablation studies. The code is available at \url{https://github.com/AmirMaEl/SemiSupMDE}.
