Mobile Robotic Multi-View Photometric Stereo
Suryansh Kumar
TL;DR
The paper addresses the limitation of traditional MVPS requiring fixed laboratory setups by introducing a portable mobile robotic MVPS system. It proposes an online incremental pipeline that jointly predicts light directions and intensities, per-pixel surface normals with uncertainty, and per-view depth priors, then refines depth via an uncertainty-guided optimization and fuses frames with online TSDF fusion. On the DiLiGenT-MV benchmark, using only 36 viewpoints with 8 PS images per view (total 288 images), the method achieves competitive geometry with state-of-the-art methods that use far more data and demonstrates over $100\times$ computational efficiency. This work enables automated, fine-grained 3D photogrammetry for mobile robotics, reducing calibration burdens and enabling flexible data acquisition in unconstrained settings.
Abstract
Multi-View Photometric Stereo (MVPS) is a popular method for fine-detailed 3D acquisition of an object from images. Despite its outstanding results on diverse material objects, a typical MVPS experimental setup requires a well-calibrated light source and a monocular camera installed on an immovable base. This restricts the use of MVPS on a movable platform, limiting us from taking MVPS benefits in 3D acquisition for mobile robotics applications. To this end, we introduce a new mobile robotic system for MVPS. While the proposed system brings advantages, it introduces additional algorithmic challenges. Addressing them, in this paper, we further propose an incremental approach for mobile robotic MVPS. Our approach leverages a supervised learning setup to predict per-view surface normal, object depth, and per-pixel uncertainty in model-predicted results. A refined depth map per view is obtained by solving an MVPS-driven optimization problem proposed in this paper. Later, we fuse the refined depth map while tracking the camera pose w.r.t the reference frame to recover globally consistent object 3D geometry. Experimental results show the advantages of our robotic system and algorithm, featuring the local high-frequency surface detail recovery with globally consistent object shape. Our work is beyond any MVPS system yet presented, providing encouraging results on objects with unknown reflectance properties using fewer frames without a tiring calibration and installation process, enabling computationally efficient robotic automation approach to photogrammetry. The proposed approach is nearly 100 times computationally faster than the state-of-the-art MVPS methods such as [1, 2] while maintaining the similar results when tested on subjects taken from the benchmark DiLiGenT MV dataset [3].
