MatMart: Material Reconstruction of 3D Objects via Diffusion
Xiuchao Wu, Pengfei Zhu, Jiangjing Lyu, Xinguo Liu, Jie Guo, Yanwen Guo, Weiwei Xu, Chengfei Lyu
TL;DR
MatMart tackles the challenge of reconstructing physically-based rendering materials for 3D objects from RGB images by leveraging diffusion models in a two-stage pipeline. It first performs progressive material estimation with UV-space baking, then uses adaptive view selection and prior-guided generation to fill occluded regions, all within a single end-to-end trainable diffusion model. A core contribution is the view-material cross-attention (VMCA), which enables multi-view consistency under progressive inference and reduces memory demands. Comprehensive experiments on Objaverse show superior material prediction and generation quality, with ablations confirming the effectiveness of VMCA, material priors, and texture baking. The approach offers a scalable, stable alternative to multi-model pipelines, enabling high-resolution, view-flexible material reconstruction for practical applications.
Abstract
Applying diffusion models to physically-based material estimation and generation has recently gained prominence. In this paper, we propose \ttt, a novel material reconstruction framework for 3D objects, offering the following advantages. First, \ttt\ adopts a two-stage reconstruction, starting with accurate material prediction from inputs and followed by prior-guided material generation for unobserved views, yielding high-fidelity results. Second, by utilizing progressive inference alongside the proposed view-material cross-attention (VMCA), \ttt\ enables reconstruction from an arbitrary number of input images, demonstrating strong scalability and flexibility. Finally, \ttt\ achieves both material prediction and generation capabilities through end-to-end optimization of a single diffusion model, without relying on additional pre-trained models, thereby exhibiting enhanced stability across various types of objects. Extensive experiments demonstrate that \ttt\ achieves superior performance in material reconstruction compared to existing methods.
