Indoor 3D Reconstruction with an Unknown Camera-Projector Pair
Zhaoshuai Qi, Yifeng Hao, Rui Hu, Wenyou Chang, Jiaqi Yang, Yanning Zhang
TL;DR
This work tackles indoor 3D reconstruction with an unknown camera-projector pair by introducing a cuboid corner (C2) as a robust, common cue to derive sufficient two-view constraints for CPP self-calibration. The authors derive single- and two-view geometry around C2, showing that the CPP calibration can be reduced to a univariate problem in the camera focal length $f_c$, via a deterministic mapping to the projector intrinsics $K_p=g(K_c)$. A cycle-consistent optimization objective drives accurate intrinsic estimation, enabling dense 3D reconstructions that rival multi-view and learning-based methods while requiring only two views. The approach extends naturally to two-view SfM and highlights the importance of the cycle term in calibration, though it remains limited by degenerate C2 configurations and the need for manual segmentation of faces. Overall, the method offers a practical pathway for reliable indoor SL with unknown CPP and demonstrates potential for sparse-view camera self-calibration.
Abstract
Structured light-based method with a camera-projector pair (CPP) plays a vital role in indoor 3D reconstruction, especially for scenes with weak textures. Previous methods usually assume known intrinsics, which are pre-calibrated from known objects, or self-calibrated from multi-view observations. It is still challenging to reliably recover CPP intrinsics from only two views without any known objects. In this paper, we provide a simple yet reliable solution. We demonstrate that, for the first time, sufficient constraints on CPP intrinsics can be derived from an unknown cuboid corner (C2), e.g. a room's corner, which is a common structure in indoor scenes. In addition, with only known camera principal point, the complex multi-variable estimation of all CPP intrinsics can be simplified to a simple univariable optimization problem, leading to reliable calibration and thus direct 3D reconstruction with unknown CPP. Extensive results have demonstrated the superiority of the proposed method over both traditional and learning-based counterparts. Furthermore, the proposed method also demonstrates impressive potential to solve similar tasks without active lighting, such as sparse-view structure from motion.
