Relightable Holoported Characters: Capturing and Relighting Dynamic Human Performance from Sparse Views
Kunwar Maheep Singh, Jianchun Chen, Vladislav Golyanik, Stephan J. Garbin, Thabo Beeler, Rishabh Dabral, Marc Habermann, Christian Theobalt
TL;DR
This work tackles relighting dynamic human performances from sparse RGB views by introducing Relightable Holoported Characters (RHC), an end-to-end pipeline that animates a subject-specific mesh, computes physics-informed features in UV space, and uses a transformer-based RelightNet to predict texel-aligned 3D Gaussians for relit appearance in unseen lighting and viewpoints. A novel multi-view lightstage capture strategy interleaves environment-map lighting with tracking frames, enabling diverse illumination and accurate geometry guidance without OLAT. The method demonstrates superior visual fidelity and lighting reproduction over state-of-the-art baselines across multiple subjects and unseen motions, with ablations confirming the importance of geometry, albedo, shading, and cross-attention mechanisms. While limited by identity specificity and runtime, RHC opens pathways for scalable, photorealistic relighting of dynamic human avatars in virtual environments and telepresence.
Abstract
We present Relightable Holoported Characters (RHC), a novel person-specific method for free-view rendering and relighting of full-body and highly dynamic humans solely observed from sparse-view RGB videos at inference. In contrast to classical one-light-at-a-time (OLAT)-based human relighting, our transformer-based RelightNet predicts relit appearance within a single network pass, avoiding costly OLAT-basis capture and generation. For training such a model, we introduce a new capture strategy and dataset recorded in a multi-view lightstage, where we alternate frames lit by random environment maps with uniformly lit tracking frames, simultaneously enabling accurate motion tracking and diverse illumination as well as dynamics coverage. Inspired by the rendering equation, we derive physics-informed features that encode geometry, albedo, shading, and the virtual camera view from a coarse human mesh proxy and the input views. Our RelightNet then takes these features as input and cross-attends them with a novel lighting condition, and regresses the relit appearance in the form of texel-aligned 3D Gaussian splats attached to the coarse mesh proxy. Consequently, our RelightNet implicitly learns to efficiently compute the rendering equation for novel lighting conditions within a single feed-forward pass. Experiments demonstrate our method's superior visual fidelity and lighting reproduction compared to state-of-the-art approaches. Project page: https://vcai.mpi-inf.mpg.de/projects/RHC/
