Masked Spatial Propagation Network for Sparsity-Adaptive Depth Refinement
Jinyoung Jun, Jae-Han Lee, Chang-Su Kim
TL;DR
This work tackles depth completion under varying sparsity by introducing sparsity-adaptive depth refinement (SDR) and a masked spatial propagation network (MSPN). The SDR framework combines an off-the-shelf monocular depth estimator, a guidance network, and MSPN to iteratively refine depth maps while updating a propagation mask, enabling robust performance when the number of sparse depth points changes. MSPN uses a masked attention mechanism over pixel neighborhoods and adaptive iteration counts to propagate information from sparse measurements to the full image, achieving state-of-the-art results on both SDR and traditional depth completion benchmarks (NYUv2 and KITTI). The approach generalizes across different monocular estimators and datasets, and provides practical robustness for real-world depth sensing where sparsity is often variable and unpredictable.
Abstract
The main function of depth completion is to compensate for an insufficient and unpredictable number of sparse depth measurements of hardware sensors. However, existing research on depth completion assumes that the sparsity -- the number of points or LiDAR lines -- is fixed for training and testing. Hence, the completion performance drops severely when the number of sparse depths changes significantly. To address this issue, we propose the sparsity-adaptive depth refinement (SDR) framework, which refines monocular depth estimates using sparse depth points. For SDR, we propose the masked spatial propagation network (MSPN) to perform SDR with a varying number of sparse depths effectively by gradually propagating sparse depth information throughout the entire depth map. Experimental results demonstrate that MPSN achieves state-of-the-art performance on both SDR and conventional depth completion scenarios.
