MeshMosaic: Scaling Artist Mesh Generation via Local-to-Global Assembly
Rui Xu, Tianyang Xue, Qiujie Dong, Le Wan, Zhe Zhu, Peng Li, Zhiyang Dou, Cheng Lin, Shiqing Xin, Yuan Liu, Wenping Wang, Taku Komura
TL;DR
MeshMosaic tackles the challenge of generating artist-quality meshes at high triangle counts by introducing a local-to-global autoregressive framework that assembles boundary-conditioned patches. Each patch is generated with boundary guidance and local-to-global features, using per-patch local quantization to $512^3$ to preserve detail while maintaining coherence across seams. The approach achieves meshes exceeding $10^5$ triangles with state-of-the-art geometric fidelity and strong user-preference performance on ShapeNet, Thingi10K, and Objaverse, illustrating its practical potential for game and film pipelines. Overall, MeshMosaic offers a scalable, patch-based paradigm for high-resolution 3D mesh synthesis that overcomes long-sequence bottlenecks and enables fine-grained detail recovery in artist-driven assets.
Abstract
Scaling artist-designed meshes to high triangle numbers remains challenging for autoregressive generative models. Existing transformer-based methods suffer from long-sequence bottlenecks and limited quantization resolution, primarily due to the large number of tokens required and constrained quantization granularity. These issues prevent faithful reproduction of fine geometric details and structured density patterns. We introduce MeshMosaic, a novel local-to-global framework for artist mesh generation that scales to over 100K triangles--substantially surpassing prior methods, which typically handle only around 8K faces. MeshMosaic first segments shapes into patches, generating each patch autoregressively and leveraging shared boundary conditions to promote coherence, symmetry, and seamless connectivity between neighboring regions. This strategy enhances scalability to high-resolution meshes by quantizing patches individually, resulting in more symmetrical and organized mesh density and structure. Extensive experiments across multiple public datasets demonstrate that MeshMosaic significantly outperforms state-of-the-art methods in both geometric fidelity and user preference, supporting superior detail representation and practical mesh generation for real-world applications.
