DeepMesh: Auto-Regressive Artist-mesh Creation with Reinforcement Learning
Ruowen Zhao, Junliang Ye, Zhengyi Wang, Guangce Liu, Yiwen Chen, Yikai Wang, Jun Zhu
TL;DR
DeepMesh tackles auto-regressive artist-mesh generation by addressing long token sequences and lack of human alignment. It introduces an efficient mesh tokenization with block-wise coordinate indexing, a data-curated, truncated-training pre-training pipeline, and a DPO-based RL loop to align outputs with human preferences. With 5k preference pairs and an Hourglass Transformer backbone, it achieves high-fidelity, topologically rich meshes conditioned on point clouds or images, surpassing state-of-the-art baselines in both geometry and aesthetics. This work presents a scalable, human-aligned path to high-detail 3D meshes suitable for diverse creative and practical applications.
Abstract
Triangle meshes play a crucial role in 3D applications for efficient manipulation and rendering. While auto-regressive methods generate structured meshes by predicting discrete vertex tokens, they are often constrained by limited face counts and mesh incompleteness. To address these challenges, we propose DeepMesh, a framework that optimizes mesh generation through two key innovations: (1) an efficient pre-training strategy incorporating a novel tokenization algorithm, along with improvements in data curation and processing, and (2) the introduction of Reinforcement Learning (RL) into 3D mesh generation to achieve human preference alignment via Direct Preference Optimization (DPO). We design a scoring standard that combines human evaluation with 3D metrics to collect preference pairs for DPO, ensuring both visual appeal and geometric accuracy. Conditioned on point clouds and images, DeepMesh generates meshes with intricate details and precise topology, outperforming state-of-the-art methods in both precision and quality. Project page: https://zhaorw02.github.io/DeepMesh/
