ClothHMR: 3D Mesh Recovery of Humans in Diverse Clothing from Single Image

Yunqi Gao; Leyuan Liu; Yuhan Li; Changxin Gao; Yuanyuan Liu; Jingying Chen

ClothHMR: 3D Mesh Recovery of Humans in Diverse Clothing from Single Image

Yunqi Gao, Leyuan Liu, Yuhan Li, Changxin Gao, Yuanyuan Liu, Jingying Chen

TL;DR

ClothHMR tackles 3D human mesh recovery under diverse clothing by introducing a two-module approach: clothing tailoring (CT), which fits garments to the body silhouette using body semantics and edge cues, and FHVM-based mesh recovering (MR), which leverages a foundational human vision model to produce high-fidelity intermediate representations (joints, depth, silhouette) and iteratively refine SMPL parameters. The method demonstrates significant improvements over state-of-the-art on Cloth4D, THuman2.0, EMDB, and 3DPW, and its efficacy is validated through ablations and a web-based virtual try-on application. The work highlights the value of integrating clothing-aware preprocessing with a strong, unified foundational model to enhance robustness to loose clothing and complex poses in 3D human reconstruction.

Abstract

With 3D data rapidly emerging as an important form of multimedia information, 3D human mesh recovery technology has also advanced accordingly. However, current methods mainly focus on handling humans wearing tight clothing and perform poorly when estimating body shapes and poses under diverse clothing, especially loose garments. To this end, we make two key insights: (1) tailoring clothing to fit the human body can mitigate the adverse impact of clothing on 3D human mesh recovery, and (2) utilizing human visual information from large foundational models can enhance the generalization ability of the estimation. Based on these insights, we propose ClothHMR, to accurately recover 3D meshes of humans in diverse clothing. ClothHMR primarily consists of two modules: clothing tailoring (CT) and FHVM-based mesh recovering (MR). The CT module employs body semantic estimation and body edge prediction to tailor the clothing, ensuring it fits the body silhouette. The MR module optimizes the initial parameters of the 3D human mesh by continuously aligning the intermediate representations of the 3D mesh with those inferred from the foundational human visual model (FHVM). ClothHMR can accurately recover 3D meshes of humans wearing diverse clothing, precisely estimating their body shapes and poses. Experimental results demonstrate that ClothHMR significantly outperforms existing state-of-the-art methods across benchmark datasets and in-the-wild images. Additionally, a web application for online fashion and shopping powered by ClothHMR is developed, illustrating that ClothHMR can effectively serve real-world usage scenarios. The code and model for ClothHMR are available at: \url{https://github.com/starVisionTeam/ClothHMR}.

ClothHMR: 3D Mesh Recovery of Humans in Diverse Clothing from Single Image

TL;DR

Abstract

ClothHMR: 3D Mesh Recovery of Humans in Diverse Clothing from Single Image

TL;DR

Abstract

Paper Structure

Table of Contents

Figures (15)