MaPa: Text-driven Photorealistic Material Painting for 3D Shapes
Shangzan Zhang, Sida Peng, Tao Xu, Yuanbo Yang, Tianrun Chen, Nan Xue, Yujun Shen, Hujun Bao, Ruizhen Hu, Xiaowei Zhou
TL;DR
MaPa addresses text-driven material painting for 3D shapes by using a segment-wise procedural material graph representation and a segment-controlled diffusion bridge to connect text prompts with high-resolution, relightable materials. It segments the mesh, generates segment-aligned 2D images via a segment-conditioned diffusion model, then initializes and optimizes segment-level material graphs through a differentiable rendering pipeline, with iterative recovery and downstream editing. The approach achieves photorealistic, tileable materials with strong editability and outperforms strong baselines in both quantitative metrics (FID/KID) and user studies, while enabling diverse results and image-prompt appearance transfer. This work advances practical 3D asset creation by combining diffusion-based image synthesis, CLIP-based material retrieval, and differentiable graphics to produce controllable, high-quality materials for complex geometries.
Abstract
This paper aims to generate materials for 3D meshes from text descriptions. Unlike existing methods that synthesize texture maps, we propose to generate segment-wise procedural material graphs as the appearance representation, which supports high-quality rendering and provides substantial flexibility in editing. Instead of relying on extensive paired data, i.e., 3D meshes with material graphs and corresponding text descriptions, to train a material graph generative model, we propose to leverage the pre-trained 2D diffusion model as a bridge to connect the text and material graphs. Specifically, our approach decomposes a shape into a set of segments and designs a segment-controlled diffusion model to synthesize 2D images that are aligned with mesh parts. Based on generated images, we initialize parameters of material graphs and fine-tune them through the differentiable rendering module to produce materials in accordance with the textual description. Extensive experiments demonstrate the superior performance of our framework in photorealism, resolution, and editability over existing methods. Project page: https://zju3dv.github.io/MaPa
