Denoising Functional Maps: Diffusion Models for Shape Correspondence
Aleksei Zhuravlev, Zorah Lähner, Vladislav Golyanik
TL;DR
DenoisFM introduces a fundamentally new approach to shape correspondence by directly predicting functional maps with denoising diffusion models. It leverages template-based training, a sign-corrected Laplacian eigenbasis, and a diffusion-driven conditioning mechanism to produce accurate and generalizable maps across near-isometric, anisotropic, and cross-domain shapes, including animals. A key contribution is the unsupervised sign correction that canonicalizes eigenvectors, enabling stable learning of $C_{1T}$ and subsequent pairwise maps via composition and ZoomOut refinement. The results demonstrate competitive performance on standard datasets (FAUST, SCAPE, SHREC'19), strong cross-category generalization (zero-shot DT4D), and applicability to animal shapes, highlighting the potential of diffusion models in geometric matching and broader generalization capabilities.
Abstract
Estimating correspondences between pairs of deformable shapes remains a challenging problem. Despite substantial progress, existing methods lack broad generalization capabilities and require category-specific training data. To address these limitations, we propose a fundamentally new approach to shape correspondence based on denoising diffusion models. In our method, a diffusion model learns to directly predict the functional map, a low-dimensional representation of a point-wise map between shapes. We use a large dataset of synthetic human meshes for training and employ two steps to reduce the number of functional maps that need to be learned. First, the maps refer to a template rather than shape pairs. Second, the functional map is defined in a basis of eigenvectors of the Laplacian, which is not unique due to sign ambiguity. Therefore, we introduce an unsupervised approach to select a specific basis by correcting the signs of eigenvectors based on surface features. Our model achieves competitive performance on standard human datasets, meshes with anisotropic connectivity, non-isometric humanoid shapes, as well as animals compared to existing descriptor-based and large-scale shape deformation methods. See our project page for the source code and the datasets.
