Towards 3D Vision with Low-Cost Single-Photon Cameras
Fangzhou Mu, Carter Sifferman, Sacha Jungerman, Yiquan Li, Mark Han, Michael Gleicher, Mohit Gupta, Yin Li
TL;DR
This work introduces a low-cost, time-resolved 3D vision system based on distributed SPAD proximity sensors to reconstruct complex Lambertian objects. It combines a differentiable transient formation model with a neural implicit surface (SDF) representation, rendering transients via volume rendering and optimizing against observed histograms in an analysis-by-synthesis framework. Across simulations and real hardware, the method substantially outperforms reprojection and space carving baselines, achieving Chamfer distances on the order of a few millimeters and demonstrating robustness to ambient light and texture. The approach paves the way for practical, energy-efficient 3D sensing with commodity hardware in robotics, wearables, and mobile platforms, while outlining future directions for handling non-Lambertian reflectance and real-time operation.
Abstract
We present a method for reconstructing 3D shape of arbitrary Lambertian objects based on measurements by miniature, energy-efficient, low-cost single-photon cameras. These cameras, operating as time resolved image sensors, illuminate the scene with a very fast pulse of diffuse light and record the shape of that pulse as it returns back from the scene at a high temporal resolution. We propose to model this image formation process, account for its non-idealities, and adapt neural rendering to reconstruct 3D geometry from a set of spatially distributed sensors with known poses. We show that our approach can successfully recover complex 3D shapes from simulated data. We further demonstrate 3D object reconstruction from real-world captures, utilizing measurements from a commodity proximity sensor. Our work draws a connection between image-based modeling and active range scanning and is a step towards 3D vision with single-photon cameras.
