Generalizable and Relightable Gaussian Splatting for Human Novel View Synthesis
Yipengjing Sun, Shengping Zhang, Chenyang Wang, Shunyuan Zheng, Zonglin Li, Xiangyang Ji
TL;DR
This work introduces GRGS, a generalizable and relightable 3D Gaussian framework for human novel view synthesis that operates without per-character optimization. It combines Lighting-robust Geometry Refinement (LGR) to recover robust depth and high-quality normals with a Physically Grounded Neural Rendering (PGNR) module that fuses physics-based shading and neural prediction, aided by a 2D-to-3D projection training strategy to minimize explicit ray tracing. The method parameterizes and predicts geometry-aware Gaussian attributes and SH-based light transport, enabling editable relighting with shadows and indirect illumination. Experiments on multi-view human datasets and real-world data demonstrate improved geometric fidelity, relighting realism, and strong generalization across identities and illumination conditions, with efficient inference around 20 FPS on a high-end GPU.
Abstract
We propose GRGS, a generalizable and relightable 3D Gaussian framework for high-fidelity human novel view synthesis under diverse lighting conditions. Unlike existing methods that rely on per-character optimization or ignore physical constraints, GRGS adopts a feed-forward, fully supervised strategy projecting geometry, material, and illumination cues from multi-view 2D observations into 3D Gaussian representations. To recover accurate geometry under diverse lighting conditions, we introduce a Lighting-robust Geometry Refinement (LGR) module trained on synthetically relit data to predict precise depth and surface normals. Based on the high-quality geometry, a Physically Grounded Neural Rendering (PGNR) module is further proposed to integrate neural prediction with physics-based shading, supporting editable relighting with shadows and indirect illumination. Moreover, we design a 2D-to-3D projection training scheme leveraging differentiable supervision from ambient occlusion, direct, and indirect lighting maps, alleviating the computational cost of ray tracing. Extensive experiments demonstrate that GRGS achieves superior visual quality, geometric consistency, and generalization across characters and lighting conditions.
