PolyDiff: Generating 3D Polygonal Meshes with Diffusion Models
Antonio Alliegro, Yawar Siddiqui, Tatiana Tommasi, Matthias Nießner
TL;DR
PolyDiff presents a novel discrete diffusion framework that directly generates 3D polygonal meshes by modeling meshes as quantized triangle soups and denoising with a transformer-based network. By operating in the discrete vertex-coordinate space, it jointly learns vertex positions and face topology without post-processing, achieving superior FID and JSD over prior methods on ShapeNet categories. The paper demonstrates strong quantitative gains, ablation results favoring discrete over continuous diffusion, and qualitative evidence of novel, coherent meshes, while outlining limitations such as scene-level generation and sampling speed. Overall, PolyDiff offers a practical, diffusion-based path to high-fidelity 3D meshes directly in the mesh domain, with potential to reduce artist workload in downstream pipelines.
Abstract
We introduce PolyDiff, the first diffusion-based approach capable of directly generating realistic and diverse 3D polygonal meshes. In contrast to methods that use alternate 3D shape representations (e.g. implicit representations), our approach is a discrete denoising diffusion probabilistic model that operates natively on the polygonal mesh data structure. This enables learning of both the geometric properties of vertices and the topological characteristics of faces. Specifically, we treat meshes as quantized triangle soups, progressively corrupted with categorical noise in the forward diffusion phase. In the reverse diffusion phase, a transformer-based denoising network is trained to revert the noising process, restoring the original mesh structure. At inference, new meshes can be generated by applying this denoising network iteratively, starting with a completely noisy triangle soup. Consequently, our model is capable of producing high-quality 3D polygonal meshes, ready for integration into downstream 3D workflows. Our extensive experimental analysis shows that PolyDiff achieves a significant advantage (avg. FID and JSD improvement of 18.2 and 5.8 respectively) over current state-of-the-art methods.
