LaFiTe: A Generative Latent Field for 3D Native Texturing

Chia-Hao Chen; Zi-Xin Zou; Yan-Pei Cao; Ze Yuan; Guan Luo; Xiaojuan Qi; Ding Liang; Song-Hai Zhang; Yuan-Chen Guo

LaFiTe: A Generative Latent Field for 3D Native Texturing

Chia-Hao Chen, Zi-Xin Zou, Yan-Pei Cao, Ze Yuan, Guan Luo, Xiaojuan Qi, Ding Liang, Song-Hai Zhang, Yuan-Chen Guo

TL;DR

3D-native texturing remains limited by topology- and UV-dependent representations. LaFiTe introduces a sparse latent color field learned with a VAE from densely sampled colored point clouds, plus a geometry latent derived from monochrome input to condition generation. A conditional rectified-flow model then synthesizes high-quality textures that are coherent across complex geometries and can be baked into UV maps for rendering, while enabling PBR materials and local refinement. Results show >10 dB PSNR gains over prior native approaches and strong performance against multi-view projection baselines, highlighting the practical potential for scalable 3D content creation workflows.

Abstract

Generating high-fidelity, seamless textures directly on 3D surfaces, what we term 3D-native texturing, remains a fundamental open challenge, with the potential to overcome long-standing limitations of UV-based and multi-view projection methods. However, existing native approaches are constrained by the absence of a powerful and versatile latent representation, which severely limits the fidelity and generality of their generated textures. We identify this representation gap as the principal barrier to further progress. We introduce LaFiTe, a framework that addresses this challenge by learning to generate textures as a 3D generative sparse latent color field. At its core, LaFiTe employs a variational autoencoder (VAE) to encode complex surface appearance into a sparse, structured latent space, which is subsequently decoded into a continuous color field. This representation achieves unprecedented fidelity, exceeding state-of-the-art methods by >10 dB PSNR in reconstruction, by effectively disentangling texture appearance from mesh topology and UV parameterization. Building upon this strong representation, a conditional rectified-flow model synthesizes high-quality, coherent textures across diverse styles and geometries. Extensive experiments demonstrate that LaFiTe not only sets a new benchmark for 3D-native texturing but also enables flexible downstream applications such as material synthesis and texture super-resolution, paving the way for the next generation of 3D content creation workflows.

LaFiTe: A Generative Latent Field for 3D Native Texturing

TL;DR

Abstract

LaFiTe: A Generative Latent Field for 3D Native Texturing

TL;DR

Abstract

Paper Structure

Table of Contents

Figures (8)