Thin-Shell-SfT: Fine-Grained Monocular Non-rigid 3D Surface Tracking with Neural Deformation Fields

Navami Kairanda; Marc Habermann; Shanthika Naik; Christian Theobalt; Vladislav Golyanik

Thin-Shell-SfT: Fine-Grained Monocular Non-rigid 3D Surface Tracking with Neural Deformation Fields

Navami Kairanda, Marc Habermann, Shanthika Naik, Christian Theobalt, Vladislav Golyanik

TL;DR

This work tackles monocular non-rigid 3D surface tracking of highly deformable objects (e.g., cloth). It introduces Thin-Shell-SfT, which replaces discrete meshes with a continuous adaptive neural surface and couples a Kirchhoff-Love thin-shell prior with differentiable 3D Gaussian Splatting to enforce photometric consistency via analysis-by-synthesis. A dual-network deformation representation is learned: a Neural Reference Field $\bar{x}(\xi;\Upsilon)$ for the template and a Neural Deformation Field $u(\xi,t;\Theta)$ for time-varying motion, enabling high-frequency wrinkle capture. On the $\boldsymbol{\phi}$-SfT dataset, the method outperforms prior SfT/NRSfM and physics-based approaches in geometry accuracy and temporal coherence, while remaining computationally feasible for monocular video sequences.

Abstract

3D reconstruction of highly deformable surfaces (e.g. cloths) from monocular RGB videos is a challenging problem, and no solution provides a consistent and accurate recovery of fine-grained surface details. To account for the ill-posed nature of the setting, existing methods use deformation models with statistical, neural, or physical priors. They also predominantly rely on nonadaptive discrete surface representations (e.g. polygonal meshes), perform frame-by-frame optimisation leading to error propagation, and suffer from poor gradients of the mesh-based differentiable renderers. Consequently, fine surface details such as cloth wrinkles are often not recovered with the desired accuracy. In response to these limitations, we propose ThinShell-SfT, a new method for non-rigid 3D tracking that represents a surface as an implicit and continuous spatiotemporal neural field. We incorporate continuous thin shell physics prior based on the Kirchhoff-Love model for spatial regularisation, which starkly contrasts the discretised alternatives of earlier works. Lastly, we leverage 3D Gaussian splatting to differentiably render the surface into image space and optimise the deformations based on analysis-bysynthesis principles. Our Thin-Shell-SfT outperforms prior works qualitatively and quantitatively thanks to our continuous surface formulation in conjunction with a specially tailored simulation prior and surface-induced 3D Gaussians. See our project page at https://4dqv.mpiinf.mpg.de/ThinShellSfT.

Thin-Shell-SfT: Fine-Grained Monocular Non-rigid 3D Surface Tracking with Neural Deformation Fields

TL;DR

Abstract

Thin-Shell-SfT: Fine-Grained Monocular Non-rigid 3D Surface Tracking with Neural Deformation Fields

TL;DR

Abstract

Paper Structure

Table of Contents

Figures (12)