Table of Contents
Fetching ...

CloSET: Modeling Clothed Humans on Continuous Surface with Explicit Template Decomposition

Hongwen Zhang, Siyou Lin, Ruizhi Shao, Yuxiang Zhang, Zerong Zheng, Han Huang, Yandong Guo, Yebin Liu

TL;DR

To tackle the seam artifact issues in recent state-of-the-art point-based methods, this paper proposes to learn point features on a body surface, which establishes a continuous and compact feature space to capture the fine-grained and pose-dependent clothing geometry.

Abstract

Creating animatable avatars from static scans requires the modeling of clothing deformations in different poses. Existing learning-based methods typically add pose-dependent deformations upon a minimally-clothed mesh template or a learned implicit template, which have limitations in capturing details or hinder end-to-end learning. In this paper, we revisit point-based solutions and propose to decompose explicit garment-related templates and then add pose-dependent wrinkles to them. In this way, the clothing deformations are disentangled such that the pose-dependent wrinkles can be better learned and applied to unseen poses. Additionally, to tackle the seam artifact issues in recent state-of-the-art point-based methods, we propose to learn point features on a body surface, which establishes a continuous and compact feature space to capture the fine-grained and pose-dependent clothing geometry. To facilitate the research in this field, we also introduce a high-quality scan dataset of humans in real-world clothing. Our approach is validated on two existing datasets and our newly introduced dataset, showing better clothing deformation results in unseen poses. The project page with code and dataset can be found at https://zhanghongwen.cn/closet.

CloSET: Modeling Clothed Humans on Continuous Surface with Explicit Template Decomposition

TL;DR

To tackle the seam artifact issues in recent state-of-the-art point-based methods, this paper proposes to learn point features on a body surface, which establishes a continuous and compact feature space to capture the fine-grained and pose-dependent clothing geometry.

Abstract

Creating animatable avatars from static scans requires the modeling of clothing deformations in different poses. Existing learning-based methods typically add pose-dependent deformations upon a minimally-clothed mesh template or a learned implicit template, which have limitations in capturing details or hinder end-to-end learning. In this paper, we revisit point-based solutions and propose to decompose explicit garment-related templates and then add pose-dependent wrinkles to them. In this way, the clothing deformations are disentangled such that the pose-dependent wrinkles can be better learned and applied to unseen poses. Additionally, to tackle the seam artifact issues in recent state-of-the-art point-based methods, we propose to learn point features on a body surface, which establishes a continuous and compact feature space to capture the fine-grained and pose-dependent clothing geometry. To facilitate the research in this field, we also introduce a high-quality scan dataset of humans in real-world clothing. Our approach is validated on two existing datasets and our newly introduced dataset, showing better clothing deformation results in unseen poses. The project page with code and dataset can be found at https://zhanghongwen.cn/closet.
Paper Structure (28 sections, 6 equations, 12 figures, 8 tables)

This paper contains 28 sections, 6 equations, 12 figures, 8 tables.

Figures (12)

  • Figure 1: Our method learns to decompose garment templates (top row) and add pose-dependent wrinkles upon them (bottom row).
  • Figure 2: Overview of the proposed method CloSET. Given an input body model, its pose code and garment code are processed hierarchically by point-based pose and garment encoders $\mathcal{F}_p$ and $\mathcal{F}_g$ for the learning of surface features $\bm{\phi}_p$ and $\bm{\phi}_g$. For any point $\bm{p}^t_i$ lying on the template surface, its features $\bm{\phi}(\bm{p}^t_i)$ are sampled from surface features accordingly and fed into two decoders for the prediction of the explicit garment template and pose-dependent wrinkle displacements, which will be combined and transformed to the clothing point cloud.
  • Figure 3: Comparison of the bilinear interpolation on the UV plane and the barycentric interpolation on the surface.
  • Figure 4: Comparison of different clothed human modeling methods on the proposed real-world scan dataset.
  • Figure 5: Comparison of the approach learned on UV planes (POP ma2021power) and the approach learned on continuous surfaces (Ours). Our solution alleviates the seam artifacts of POP.
  • ...and 7 more figures