TetraDiffusion: Tetrahedral Diffusion Models for 3D Shape Generation
Nikolai Kalischek, Torben Peters, Jan D. Wegner, Konrad Schindler
TL;DR
TetraDiffusion introduces a first-of-its-kind 3D denoising diffusion model that operates directly on a deformable tetrahedral grid to generate high-resolution, textured meshes. By designing tetrahedral-specific convolution operators and a differentiable marching tetrahedra pipeline, the method achieves fast sampling and scalable memory use, and it supports color attributes for textured assets. The approach demonstrates superior geometric detail and competitive texture quality against strong 3D mesh and diffusion baselines, while enabling conditioning, interpolation, and test-time guidance. This has practical impact for rapid, scalable creation of detailed 3D assets on consumer hardware, with potential extensions to conditional generation and scene-level synthesis.
Abstract
Probabilistic denoising diffusion models (DDMs) have set a new standard for 2D image generation. Extending DDMs for 3D content creation is an active field of research. Here, we propose TetraDiffusion, a diffusion model that operates on a tetrahedral partitioning of 3D space to enable efficient, high-resolution 3D shape generation. Our model introduces operators for convolution and transpose convolution that act directly on the tetrahedral partition, and seamlessly includes additional attributes such as color. Remarkably, TetraDiffusion enables rapid sampling of detailed 3D objects in nearly real-time with unprecedented resolution. It's also adaptable for generating 3D shapes conditioned on 2D images. Compared to existing 3D mesh diffusion techniques, our method is up to 200 times faster in inference speed, works on standard consumer hardware, and delivers superior results.
