SteeredMarigold: Steering Diffusion Towards Depth Completion of Largely Incomplete Depth Maps
Jakub Gregorek, Lazaros Nalpantidis
TL;DR
SteeredMarigold tackles depth completion under extreme depth sparsity by conditioning a pre-trained diffusion-based monocular depth estimator (Marigold) with sparse, metric depth points. It performs diffusion in a latent VAE space and introduces a plug-and-play steering mechanism that nudges the diffusion process toward known depth constraints through linear interpolations and a steering factor $\lambda$, without any training. The approach yields metrically consistent depth in largely incomplete maps, achieving state-of-the-art results on NYUv2 in such scenarios and exhibiting robustness to missing-depth regions. While highly effective in incomplete-sparsity settings, it trades off real-time performance for accuracy and remains to be validated across more datasets and modalities.
Abstract
Even if the depth maps captured by RGB-D sensors deployed in real environments are often characterized by large areas missing valid depth measurements, the vast majority of depth completion methods still assumes depth values covering all areas of the scene. To address this limitation, we introduce SteeredMarigold, a training-free, zero-shot depth completion method capable of producing metric dense depth, even for largely incomplete depth maps. SteeredMarigold achieves this by using the available sparse depth points as conditions to steer a denoising diffusion probabilistic model. Our method outperforms relevant top-performing methods on the NYUv2 dataset, in tests where no depth was provided for a large area, achieving state-of-art performance and exhibiting remarkable robustness against depth map incompleteness. Our source code is publicly available at https://steeredmarigold.github.io.
