A Mesh Is Worth 512 Numbers: Spectral-domain Diffusion Modeling for High-dimension Shape Generation
Jiajie Fan, Amal Trigui, Andrea Bonfanti, Felix Dietrich, Thomas Bäck, Hao Wang
TL;DR
SpoDify introduces a learning-free spectral-domain diffusion framework that encodes meshes via SVD into compact spectral features and a DWT basis, enabling diffusion in a low-dimensional latent space. The method combines clustering-based representation, SDF, and wavelet transforms to produce a compact $d$-dimensional latent, here $d=512$, and reconstructs meshes via inverse transforms and marching cubes. Empirical results on ShapeNet chairs and airplanes show competitive quality with significantly reduced training time and memory requirements, highlighting strong efficiency and scalability for high-dimensional mesh generation. The approach offers a practical path toward diffusion-based 3D generation with limited data and resources, and points to future work incorporating full wavelet details and multi-category extensions.
Abstract
Recent advancements in learning latent codes derived from high-dimensional shapes have demonstrated impressive outcomes in 3D generative modeling. Traditionally, these approaches employ a trained autoencoder to acquire a continuous implicit representation of source shapes, which can be computationally expensive. This paper introduces a novel framework, spectral-domain diffusion for high-quality shape generation SpoDify, that utilizes singular value decomposition (SVD) for shape encoding. The resulting eigenvectors can be stored for subsequent decoding, while generative modeling is performed on the eigenfeatures. This approach efficiently encodes complex meshes into continuous implicit representations, such as encoding a 15k-vertex mesh to a 512-dimensional latent code without learning. Our method exhibits significant advantages in scenarios with limited samples or GPU resources. In mesh generation tasks, our approach produces high-quality shapes that are comparable to state-of-the-art methods.
