SwitchLight: Co-design of Physics-driven Architecture and Pre-training Framework for Human Portrait Relighting
Hoon Kim, Minje Jang, Wonjun Yoon, Jisoo Lee, Donghyun Na, Sanghyun Woo
TL;DR
SwitchLight tackles portrait relighting as an ill-posed problem by jointly designing a physics-driven architecture and a self-supervised pre-training strategy. It replaces the empirical Phong model in prior work with the Cook-Torrance BRDF and introduces Multi-Masked Autoencoder (MMAE) pre-training to scale training without heavy light-stage data. The framework decomposes inputs into normals, albedo, roughness, and lighting, and renders target-lit images via a two-stage inverse rendering and re-rendering pipeline, augmented by neural refinement. Empirical results on OLAT and FFHQ-based studies show improved realism, skin and hair detail, and consistent lighting, highlighting strong potential for VR/AR content creation and beyond-image applications.
Abstract
We introduce a co-designed approach for human portrait relighting that combines a physics-guided architecture with a pre-training framework. Drawing on the Cook-Torrance reflectance model, we have meticulously configured the architecture design to precisely simulate light-surface interactions. Furthermore, to overcome the limitation of scarce high-quality lightstage data, we have developed a self-supervised pre-training strategy. This novel combination of accurate physical modeling and expanded training dataset establishes a new benchmark in relighting realism.
