Neural Wavelet-domain Diffusion for 3D Shape Generation
Ka-Hei Hui, Ruihui Li, Jingyu Hu, Chi-Wing Fu
TL;DR
<3-5 sentence high-level summary> The paper tackles the challenge of directly generating high-fidelity 3D shapes by operating on a continuous implicit surface representation in the wavelet frequency domain. It introduces a compact representation built from a truncated TSDF decomposed into coarse and detail coefficient volumes, and trains a diffusion-based generator for the coarse coefficients plus a detail predictor for fine details, enabling rich shapes with complex topology. Through ShapeNet-based experiments, the approach achieves superior qualitative and quantitative performance compared with state-of-the-art methods, highlighting improvements in surface cleanliness, detail, and diversity. This wavelet-domain diffusion framework opens avenues for efficient implicit-surface generation and potential extensions to conditioning, editing, and animation.
Abstract
This paper presents a new approach for 3D shape generation, enabling direct generative modeling on a continuous implicit representation in wavelet domain. Specifically, we propose a compact wavelet representation with a pair of coarse and detail coefficient volumes to implicitly represent 3D shapes via truncated signed distance functions and multi-scale biorthogonal wavelets, and formulate a pair of neural networks: a generator based on the diffusion model to produce diverse shapes in the form of coarse coefficient volumes; and a detail predictor to further produce compatible detail coefficient volumes for enriching the generated shapes with fine structures and details. Both quantitative and qualitative experimental results manifest the superiority of our approach in generating diverse and high-quality shapes with complex topology and structures, clean surfaces, and fine details, exceeding the 3D generation capabilities of the state-of-the-art models.
