HandSCS: Structural Coordinate Space for Animatable Hand Gaussian Splatting
Yilan Dong, Wenqing Wang, Qing Wang, Jiahao Yang, Haohe Liu, Xiatuan Zhu, Gregory Slabaugh, Shanxin Yuan
TL;DR
HandSCS tackles animatable hand avatars from multi-view images by introducing a structure-guided Gaussian Splatting framework. It leverages a Structural Coordinate Space (SCS) to provide intra-pose structural cues through a hybrid static-dynamic bone basis and angular-radial descriptors, and an Inter-Pose Consistency Loss to enforce cross-pose coherence. Per-Gaussian embeddings and pose-aware offsets enable accurate geometry and appearance under strong deformations, with a lightweight non-rigid deformation pipeline. On InterHand2.6M, HandSCS achieves state-of-the-art performance for novel-pose animation and novel-view synthesis while maintaining real-time rendering speed.
Abstract
Creating animatable hand avatars from multi-view images requires modeling complex articulations and maintaining structural consistency across poses in real time. We present HandSCS, a structure-guided 3D Gaussian Splatting framework for high-fidelity hand animation. Unlike existing approaches that condition all Gaussians on the same global pose parameters, which are inadequate for highly articulated hands, HandSCS equips each Gaussian with explicit structural guidance from both intra-pose and inter-pose perspectives. To establish intra-pose structural guidance, we introduce a Structural Coordinate Space (SCS), which bridges the gap between sparse bones and dense Gaussians through hybrid static-dynamic coordinate basis and angular-radial descriptors. To improve cross-pose coherence, we further introduce an Inter-pose Consistency Loss that promotes consistent Gaussian attributes under similar articulations. Together, these components achieve high-fidelity results with consistent fine details, even in challenging high-deformation and self-contact regions. Experiments on the InterHand2.6M dataset demonstrate that HandSCS achieves state-of-the-art performance in hand avatar animation, confirming the effectiveness of explicit structural modeling.
