GAM-Depth: Self-Supervised Indoor Depth Estimation Leveraging a Gradient-Aware Mask and Semantic Constraints
Anqi Cheng, Zhiyuan Yang, Haiyue Zhu, Kezhi Mao
TL;DR
GAM-Depth tackles indoor self-supervised depth estimation where textureless surfaces weaken photometric supervision and object boundaries cause depth inaccuracies. It introduces a gradient-aware mask that weights the photometric loss $L_{p}$ via $M_{gra}$ depending on gradient magnitude $m$, forming $L_{gra}$, and enforces semantic consistency through a shared encoder with a proxy semantic model to produce $L_{seg}$. The final objective combines $L_{gra}$, $L_{seg}$, and regularizers, yielding state-of-the-art results on NYUv2 and improved generalization to ScanNet and InteriorNet, while producing smoother depths in textureless regions and crisper depth boundaries. This approach has practical implications for indoor robotics and navigation, where reliable depth maps across varied textures are critical.
Abstract
Self-supervised depth estimation has evolved into an image reconstruction task that minimizes a photometric loss. While recent methods have made strides in indoor depth estimation, they often produce inconsistent depth estimation in textureless areas and unsatisfactory depth discrepancies at object boundaries. To address these issues, in this work, we propose GAM-Depth, developed upon two novel components: gradient-aware mask and semantic constraints. The gradient-aware mask enables adaptive and robust supervision for both key areas and textureless regions by allocating weights based on gradient magnitudes.The incorporation of semantic constraints for indoor self-supervised depth estimation improves depth discrepancies at object boundaries, leveraging a co-optimization network and proxy semantic labels derived from a pretrained segmentation model. Experimental studies on three indoor datasets, including NYUv2, ScanNet, and InteriorNet, show that GAM-Depth outperforms existing methods and achieves state-of-the-art performance, signifying a meaningful step forward in indoor depth estimation. Our code will be available at https://github.com/AnqiCheng1234/GAM-Depth.
