Single View Garment Reconstruction Using Diffusion Mapping Via Pattern Coordinates
Ren Li, Cong Cao, Corentin Dumery, Yingxuan You, Hao Li, Pascal Fua
TL;DR
This work tackles single-view reconstruction of high-fidelity 3D garments, with a focus on loose-fitting clothing. It introduces DISP, a garment representation that extends Implicit Sewing Patterns (ISP) with a diffusion prior, enabling realistic 3D deformations in a unified UV space and a diffusion-based mapping from image pixels to 3D and UV coordinates. A multi-stage pipeline combines image observations, a diffusion-driven back-normal model, and DISP priors to recover both rest-state and deformed garment geometries, followed by refinement and body adjustments. Empirical results on synthetic CLOTH3D data show improved geometric accuracy and detail over state-of-the-art methods, and the approach supports downstream tasks such as garment retargeting and texture editing, with demonstrated generalization to in-the-wild imagery despite training on synthetic data.
Abstract
Reconstructing 3D clothed humans from images is fundamental to applications like virtual try-on, avatar creation, and mixed reality. While recent advances have enhanced human body recovery, accurate reconstruction of garment geometry -- especially for loose-fitting clothing -- remains an open challenge. We present a novel method for high-fidelity 3D garment reconstruction from single images that bridges 2D and 3D representations. Our approach combines Implicit Sewing Patterns (ISP) with a generative diffusion model to learn rich garment shape priors in a 2D UV space. A key innovation is our mapping model that establishes correspondences between 2D image pixels, UV pattern coordinates, and 3D geometry, enabling joint optimization of both 3D garment meshes and the corresponding 2D patterns by aligning learned priors with image observations. Despite training exclusively on synthetically simulated cloth data, our method generalizes effectively to real-world images, outperforming existing approaches on both tight- and loose-fitting garments. The reconstructed garments maintain physical plausibility while capturing fine geometric details, enabling downstream applications including garment retargeting and texture manipulation.
