Towards Robust Time-of-Flight Depth Denoising with Confidence-Aware Diffusion Model
Changyong He, Jin Zeng, Jiawei Zhang, Jiajie Guo
TL;DR
To address severe ToF depth noise, the paper introduces DepthCAD, a confidence-aware diffusion framework that denoises raw ToF correlation measurements rather than depth maps. It adapts pretrained Stable Diffusion 2.1 to the ToF domain by applying diffusion to normalized raw correlations with a dynamic range normalization to bridge domain gaps, and it injects a gradient-based confidence map to balance global structure with local fidelity. The two-component architecture—Raw Data Diffusion Module and Confidence Guidance Module—together achieve global structural smoothness while preserving metric accuracy, outperforming state-of-the-art methods on synthetic FLAT data and real Kinect v2 measurements. The approach demonstrates strong generalization to real-world ToF noise and provides a practical, robust solution for high-quality ToF depth in challenging lighting and distance conditions, with code available for reproducibility.
Abstract
Time-of-Flight (ToF) sensors efficiently capture scene depth, but the nonlinear depth construction procedure often results in extremely large noise variance or even invalid areas. Recent methods based on deep neural networks (DNNs) achieve enhanced ToF denoising accuracy but tend to struggle when presented with severe noise corruption due to limited prior knowledge of ToF data distribution. In this paper, we propose DepthCAD, a novel ToF denoising approach that ensures global structural smoothness by leveraging the rich prior knowledge in Stable Diffusion and maintains local metric accuracy by steering the diffusion process with confidence guidance. To adopt the pretrained image diffusion model to ToF depth denoising, we apply the diffusion on raw ToF correlation measurements with dynamic range normalization before converting to depth maps. Experimental results validate the state-of-the-art performance of the proposed scheme, and the evaluation on real data further verifies its robustness against real-world ToF noise.
