Physics-Based Decomposition of Reflectance and Shading using a Single Visible-Thermal Image Pair
Zeqing Leo Yuan, Mani Ramanagopal, Aswin C. Sankaranarayanan, Srinivasa G. Narasimhan
TL;DR
This work addresses intrinsic image decomposition (IID) from a single visible image paired with a co-registered thermal image. It develops a physics-based framework that uses heat produced by absorbed light to establish ordinal relations between visible and thermal intensities, linking them to albedo $\rho$ and shading $\eta$ through local and non-local constraints. A Double-DIP (DDIP) parameterization with edge and ordinal losses regularizes the solution without learned priors, solving for $\hat{\rho}$ and $\hat{\eta}$ by minimizing a reconstruction term plus physics-guided losses. The authors validate on synthetic and real data, introducing the VT-Intrinsic dataset of 600 visible–thermal image pairs, and demonstrate superior performance over both physics-based and learning-based baselines, highlighting the potential for scalable real-world supervision with minimal data.
Abstract
Decomposing an image into its underlying photometric factors--surface reflectance and shading--is a long-standing challenge due to the lack of extensive ground-truth data for real-world scenes. We introduce a novel physics-based approach for intrinsic image decomposition using a pair of visible and thermal images. We leverage the principle that light not reflected from an opaque surface is absorbed and detected as heat by a thermal camera. This allows us to relate the ordinalities (or relative magnitudes) between visible and thermal image intensities to the ordinalities of shading and reflectance, which enables a dense self-supervision of an optimizing neural network to recover shading and reflectance. We perform quantitative evaluations with known reflectance and shading under natural and artificial lighting, and qualitative experiments across diverse scenes. The results demonstrate superior performance over both physics-based and recent learning-based methods, providing a path toward scalable real-world data curation with supervision.
