MeshRipple: Structured Autoregressive Generation of Artist-Meshes
Junkai Lin, Hang Long, Huipeng Guo, Jielei Zhang, JiaYi Yang, Tianle Guo, Yang Yang, Jianwen Li, Wenxiao Zhang, Matthias Nießner, Wei Yang
TL;DR
MeshRipple tackles topology-context mismatch in autoregressive mesh generation by introducing Ripple Tokenization, which uses a frontier-aware BFS to keep the next-face context near the sequence tail. The method couples a structured transformer with Frontier Attention and Native Sparse Contextual Attention to access long-range cues while maintaining tractable memory, and employs an expansive prediction strategy that jointly selects the next face and the next frontier root. This combination yields improved topological fidelity and surface completeness, outperforming recent baselines on artist meshes and remaining competitive on dense meshes with lower compute. The approach enables scalable, artist-friendly mesh synthesis and paves the way for further improvements in topology control and multi-component scene generation.
Abstract
Meshes serve as a primary representation for 3D assets. Autoregressive mesh generators serialize faces into sequences and train on truncated segments with sliding-window inference to cope with memory limits. However, this mismatch breaks long-range geometric dependencies, producing holes and fragmented components. To address this critical limitation, we introduce MeshRipple, which expands a mesh outward from an active generation frontier, akin to a ripple on a surface. MeshRipple rests on three key innovations: a frontier-aware BFS tokenization that aligns the generation order with surface topology; an expansive prediction strategy that maintains coherent, connected surface growth; and a sparse-attention global memory that provides an effectively unbounded receptive field to resolve long-range topological dependencies. This integrated design enables MeshRipple to generate meshes with high surface fidelity and topological completeness, outperforming strong recent baselines.
