Single-Photon 3D Imaging with Equi-Depth Photon Histograms
Kaustubh Sadekar, David Maier, Atul Ingle
TL;DR
This work tackles bandwidth and memory bottlenecks in SPAD-based 3D imaging by replacing conventional equi-width histograms with equi-depth histograms (EDH) and online, count-free bin-boundary estimation. The proposed PEDH pipeline compresses photon data while preserving peak information, and couples it with a learning-based DeePEDH distance estimator to produce high-quality depth maps under challenging lighting. The approach yields substantial reductions in data bandwidth and in-pixel memory requirements, while enabling robust downstream tasks such as RGBD visual odometry, dense 3D reconstruction, and RGBD semantic segmentation on resource-constrained platforms. This enables practical, high-resolution 3D perception for mobile devices and AR/VR sensing, with demonstrated gains over EW-based SPCs in both simulation and hardware-emulation scenarios.
Abstract
Single-photon cameras present a promising avenue for high-resolution 3D imaging. They have ultra-high sensitivity -- down to individual photons -- and can record photon arrival times with extremely high (sub-nanosecond) resolution. Single-photon 3D cameras estimate the round-trip time of a laser pulse by forming equi-width (EW) histograms of detected photon timestamps. Acquiring and transferring such EW histograms requires high bandwidth and in-pixel memory, making SPCs less attractive in resource-constrained settings such as mobile devices and AR/VR headsets. In this work we propose a 3D sensing technique based on equi-depth (ED) histograms. ED histograms compress timestamp data more efficiently than EW histograms, reducing the bandwidth requirement. Moreover, to reduce the in-pixel memory requirement, we propose a lightweight algorithm to estimate ED histograms in an online fashion without explicitly storing the photon timestamps. This algorithm is amenable to future in-pixel implementations. We propose algorithms that process ED histograms to perform 3D computer-vision tasks of estimating scene distance maps and performing visual odometry under challenging conditions such as high ambient light. Our work paves the way towards lower bandwidth and reduced in-pixel memory requirements for SPCs, making them attractive for resource-constrained 3D vision applications. Project page: $\href{https://www.computational.camera/pedh}{https://www.computational.camera/pedh}$
