TreeMeshGPT: Artistic Mesh Generation with Autoregressive Tree Sequencing
Stefan Lionar, Jiabin Liang, Gim Hee Lee
TL;DR
TreeMeshGPT tackles the challenge of synthesizing artist-quality 3D meshes conditioned on point clouds by introducing Autoregressive Tree Sequencing, a DFS-based, dynamically growing tree traversal that retrieves the next token from triangle adjacency rather than predicting the next token in a flat sequence. Representing each triangular face with two tokens, and using a $7$-bit discretization, the model achieves a compression of about $22\%$ and scales to roughly $5{,}500$ faces under a strong $2{,}048$ point-token conditioning. Empirically, it outperforms previous autoregressive approaches in both fidelity and normal orientation, with lower CD and higher NC/|NC| on Objaverse and GSO datasets. The approach enables higher-detail artistic meshes suitable for real-time applications, though it acknowledges limitations around topology optimization and longer-sequence failure modes.
Abstract
We introduce TreeMeshGPT, an autoregressive Transformer designed to generate high-quality artistic meshes aligned with input point clouds. Instead of the conventional next-token prediction in autoregressive Transformer, we propose a novel Autoregressive Tree Sequencing where the next input token is retrieved from a dynamically growing tree structure that is built upon the triangle adjacency of faces within the mesh. Our sequencing enables the mesh to extend locally from the last generated triangular face at each step, and therefore reduces training difficulty and improves mesh quality. Our approach represents each triangular face with two tokens, achieving a compression rate of approximately 22% compared to the naive face tokenization. This efficient tokenization enables our model to generate highly detailed artistic meshes with strong point cloud conditioning, surpassing previous methods in both capacity and fidelity. Furthermore, our method generates mesh with strong normal orientation constraints, minimizing flipped normals commonly encountered in previous methods. Our experiments show that TreeMeshGPT enhances the mesh generation quality with refined details and normal orientation consistency.
