Glossy Object Reconstruction with Cost-effective Polarized Acquisition
Bojian Wu, Yifan Peng, Ruizhen Hu, Xiaowei Zhou
TL;DR
The paper tackles glossy object reconstruction by leveraging a cost-effective acquisition setup that captures multi-view polarization images with an unknown polarizer angle. It introduces a neural implicit surface framework (VolSDF-style) with decomposed diffuse and specular radiance nets, guided by a differentiable polarization rendering layer that uses the polarimetric BRDF to estimate the outgoing Stokes vector $\mathbf{s}^\text{out}$ and render at $\phi_\text{pol}$. Optimization of the loss $\mathcal{L} = \mathcal{L}_\text{rgb} + \mathcal{L}_\text{mask} + 0.1\mathcal{L}_\text{eikonal}$ yields high-fidelity geometry and radiance decomposition, demonstrated on public and real-world datasets with robustness to polarizer-angle variations. The approach reduces hardware costs and enables accurate reconstruction and novel-view synthesis, with potential applications on consumer devices such as smartphones and IoTs.
Abstract
The challenge of image-based 3D reconstruction for glossy objects lies in separating diffuse and specular components on glossy surfaces from captured images, a task complicated by the ambiguity in discerning lighting conditions and material properties using RGB data alone. While state-of-the-art methods rely on tailored and/or high-end equipment for data acquisition, which can be cumbersome and time-consuming, this work introduces a scalable polarization-aided approach that employs cost-effective acquisition tools. By attaching a linear polarizer to readily available RGB cameras, multi-view polarization images can be captured without the need for advance calibration or precise measurements of the polarizer angle, substantially reducing system construction costs. The proposed approach represents polarimetric BRDF, Stokes vectors, and polarization states of object surfaces as neural implicit fields. These fields, combined with the polarizer angle, are retrieved by optimizing the rendering loss of input polarized images. By leveraging fundamental physical principles for the implicit representation of polarization rendering, our method demonstrates superiority over existing techniques through experiments in public datasets and real captured images on both reconstruction and novel view synthesis.
