An Embeddable Implicit IUVD Representation for Part-based 3D Human Surface Reconstruction
Baoxing Li, Yong Deng, Yehui Yang, Xu Zhao
TL;DR
The paper addresses reconstructing 3D clothed human surfaces from a single image while preserving SMPL pose/shape priors. It introduces the IUVD-Feedback representation, embedding SMPL UV maps into an IUVD space and replacing costly SDF-based queries with a linear transformation, augmented by a feedback-driven query strategy. Experiments on THuman2.0 demonstrate threefold acceleration in the query-and-infer stage and improved robustness, without retraining networks, with added potential for part-based editing and generative applications due to the semantic IUVD structure. The approach bridges SMPL-based priors and implicit surface reconstruction, delivering faster, more stable, and semantically rich clothed human reconstructions suitable for integration into existing pipelines and downstream tasks.
Abstract
To reconstruct a 3D human surface from a single image, it is crucial to simultaneously consider human pose, shape, and clothing details. Recent approaches have combined parametric body models (such as SMPL), which capture body pose and shape priors, with neural implicit functions that flexibly learn clothing details. However, this combined representation introduces additional computation, e.g. signed distance calculation in 3D body feature extraction, leading to redundancy in the implicit query-and-infer process and failing to preserve the underlying body shape prior. To address these issues, we propose a novel IUVD-Feedback representation, consisting of an IUVD occupancy function and a feedback query algorithm. This representation replaces the time-consuming signed distance calculation with a simple linear transformation in the IUVD space, leveraging the SMPL UV maps. Additionally, it reduces redundant query points through a feedback mechanism, leading to more reasonable 3D body features and more effective query points, thereby preserving the parametric body prior. Moreover, the IUVD-Feedback representation can be embedded into any existing implicit human reconstruction pipeline without requiring modifications to the trained neural networks. Experiments on the THuman2.0 dataset demonstrate that the proposed IUVD-Feedback representation improves the robustness of results and achieves three times faster acceleration in the query-and-infer process. Furthermore, this representation holds potential for generative applications by leveraging its inherent semantic information from the parametric body model.
