Uncertainty-Aware 4D Gaussian Splatting for Monocular Occluded Human Rendering
Weiquan Wang, Feifei Shao, Lin Li, Zhen Wang, Jun Xiao, Long Chen
TL;DR
This work reframes monocular occluded human rendering as MAP estimation under heteroscedastic observation noise and introduces U-4DGS, a framework combining a Probabilistic Deformation Network with a Double Rasterization pipeline to produce pixel-aligned uncertainty maps that guide optimization. The uncertainty acts as an adaptive gradient modulator, selectively down-weighting unreliable observations and enabling confidence-aware regularizations to prevent geometric drift in occluded regions. Extensive experiments on ZJU-MoCap and OcMotion demonstrate state-of-the-art rendering fidelity and robustness, outperforming both discriminative and generative baselines under severe occlusion. The approach provides a principled, efficient alternative to hallucination-based methods, with practical implications for real-time, occlusion-resilient dynamic human rendering.
Abstract
High-fidelity rendering of dynamic humans from monocular videos typically degrades catastrophically under occlusions. Existing solutions incorporate external priors-either hallucinating missing content via generative models, which induces severe temporal flickering, or imposing rigid geometric heuristics that fail to capture diverse appearances. To this end, we reformulate the task as a Maximum A Posteriori estimation problem under heteroscedastic observation noise. In this paper, we propose U-4DGS, a framework integrating a Probabilistic Deformation Network and a Double Rasterization pipeline. This architecture renders pixel-aligned uncertainty maps that act as an adaptive gradient modulator, automatically attenuating artifacts from unreliable observations. Furthermore, to prevent geometric drift in regions lacking reliable visual cues, we enforce Confidence-Aware Regularizations, which leverage the learned uncertainty to selectively propagate spatial-temporal validity. Extensive experiments on ZJU-MoCap and OcMotion demonstrate that U-4DGS achieves SOTA rendering fidelity and robustness.
