RGB-Phase Speckle: Cross-Scene Stereo 3D Reconstruction via Wrapped Pre-Normalization
Kai Yang, Zijian Bai, Yang Xiao, Xinyu Li, Xiaohan Shi
TL;DR
The paper tackles cross-domain robustness in 3D reconstruction by introducing RGB-Phase Speckle, which embeds phase information into RGB speckle projections and employs phase pre-normalization to align inputs across scenes. It proposes a data-centric solution that combines a color-speckle projection pipeline with a normalization step, and evaluates performance on a newly collected RGB speckle dataset in addition to SceneFlow. Key contributions include the RGB color speckle encoding, the phase pre-normalization technique, and a large, challenging speckle dataset with sub-pixel ground truth, demonstrating improved cross-scene generalization for stereo matching networks. The work has practical implications for robust active-binocular 3D sensing in diverse environments, with potential hardware and deployment benefits for real-world measurement tasks.
Abstract
3D reconstruction garners increasing attention alongside the advancement of high-level image applications, where dense stereo matching (DSM) serves as a pivotal technique. Previous studies often rely on publicly available datasets for training, focusing on modifying network architectures or incorporating specialized modules to extract domain-invariant features and thus improve model robustness. In contrast, inspired by single-frame structured-light phase-shifting encoding, this study introduces RGB-Speckle, a cross-scene 3D reconstruction framework based on an active stereo camera system, designed to enhance robustness. Specifically, we propose a novel phase pre-normalization encoding-decoding method: first, we randomly perturb phase-shift maps and embed them into the three RGB channels to generate color speckle patterns; subsequently, the camera captures phase-encoded images modulated by objects as input to a stereo matching network. This technique effectively mitigates external interference and ensures consistent input data for RGB-Speckle, thereby bolstering cross-domain 3D reconstruction stability. To validate the proposed method, we conduct complex experiments: (1) construct a color speckle dataset for complex scenarios based on the proposed encoding scheme; (2) evaluate the impact of the phase pre-normalization encoding-decoding technique on 3D reconstruction accuracy; and (3) further investigate its robustness across diverse conditions. Experimental results demonstrate that the proposed RGB-Speckle model offers significant advantages in cross-domain and cross-scene 3D reconstruction tasks, enhancing model generalization and reinforcing robustness in challenging environments, thus providing a novel solution for robust 3D reconstruction research.
