PacTure: Efficient PBR Texture Generation on Packed Views with Visual Autoregressive Models
Fan Fei, Jiajun Tang, Fei-Peng Tian, Boxin Shi, Ping Tan
TL;DR
PacTure tackles efficient, high-fidelity PBR texture generation for untextured meshes by combining view packing with a visual autoregressive backbone. It packs six view maps into a single atlas to boost per-view resolution without increasing inference cost, and employs a two-stage, single-view-to-multi-view generation strategy with domain embeddings to achieve multi-view, multi-domain outputs that back-project reliably to UV space. The approach yields state-of-the-art texture quality and faster inference compared to baselines, demonstrating strong practical potential for scalable 3D asset creation. Limitations include occluded regions not captured by the canonical views, which are filled via UV-space extrapolation and may introduce minor inconsistencies. Overall, PacTure offers a significant advance in efficient, controllable PBR texturing with broad applicability in games, movies, and virtual/augmented reality workflows.
Abstract
We present PacTure, a novel framework for generating physically-based rendering (PBR) material textures from an untextured 3D mesh, a text description, and an optional image prompt. Early 2D generation-based texturing approaches generate textures sequentially from different views, resulting in long inference times and globally inconsistent textures. More recent approaches adopt multi-view generation with cross-view attention to enhance global consistency, which, however, limits the resolution for each view. In response to these weaknesses, we first introduce view packing, a novel technique that significantly increases the effective resolution for each view during multi-view generation without imposing additional inference cost, by formulating the arrangement of multi-view maps as a 2D rectangle bin packing problem. In contrast to UV mapping, it preserves the spatial proximity essential for image generation and maintains full compatibility with current 2D generative models. To further reduce the inference cost, we enable fine-grained control and multi-domain generation within the next-scale prediction autoregressive framework to create an efficient multi-view multi-domain generative backbone. Extensive experiments show that PacTure outperforms state-of-the-art methods in both quality of generated PBR textures and efficiency in training and inference.
