How Will It Drape Like? Capturing Fabric Mechanics from Depth Images
Carlos Rodriguez-Pardo, Melania Prieto-Martin, Dan Casas, Elena Garces
TL;DR
The paper tackles scalable, perceptually meaningful estimation of fabric mechanics from casual depth captures. It presents a depth-image based pipeline that infers six bending and stretching parameters from two static views in hanging and stretch configurations, using a sim-to-real learning framework and a novel image-domain drape similarity metric aligned with human judgments. A synthetic-then-real evaluation regime, augmented data, and a multi-image fusion capability enable robust generalization to real fabrics captured with consumer hardware, advancing digital twin cloth representations. The perceptual drape metric and ablation studies demonstrate that parameter-space errors do not always reflect perceptual drape quality, underlining the importance of domain-aligned evaluation for cloth systems.
Abstract
We propose a method to estimate the mechanical parameters of fabrics using a casual capture setup with a depth camera. Our approach enables to create mechanically-correct digital representations of real-world textile materials, which is a fundamental step for many interactive design and engineering applications. As opposed to existing capture methods, which typically require expensive setups, video sequences, or manual intervention, our solution can capture at scale, is agnostic to the optical appearance of the textile, and facilitates fabric arrangement by non-expert operators. To this end, we propose a sim-to-real strategy to train a learning-based framework that can take as input one or multiple images and outputs a full set of mechanical parameters. Thanks to carefully designed data augmentation and transfer learning protocols, our solution generalizes to real images despite being trained only on synthetic data, hence successfully closing the sim-to-real loop.Key in our work is to demonstrate that evaluating the regression accuracy based on the similarity at parameter space leads to an inaccurate distances that do not match the human perception. To overcome this, we propose a novel metric for fabric drape similarity that operates on the image domain instead on the parameter space, allowing us to evaluate our estimation within the context of a similarity rank. We show that out metric correlates with human judgments about the perception of drape similarity, and that our model predictions produce perceptually accurate results compared to the ground truth parameters.
