ZeST: Zero-Shot Material Transfer from a Single Image
Ta-Ying Cheng, Prafull Sharma, Andrew Markham, Niki Trigoni, Varun Jampani
TL;DR
ZeST addresses the problem of exemplar-based material transfer in 2D without training or explicit 3D geometry by leveraging a three-branch diffusion-guided pipeline: material encoding to obtain a latent $z_M$, depth-based geometry guidance via ControlNet to preserve input geometry, and latent illumination guidance through inpainting with a grayscale foreground initialization. The method injects $z_M$ into a pre-trained inpainting diffusion model through cross-attention, resulting in $I_{gen}$ that combines the exemplar’s material with the input’s geometry and lighting. Evaluations on real and synthetic datasets show improved material fidelity and photorealism over strong baselines while remaining entirely training-free, and the framework supports multiple object edits and lighting-aware variations. ZeST thus provides a scalable, practical tool for artists and graphics pipelines, with potential extensions to exemplar-based 3D texturing and relighting in untextured mesh renderings.
Abstract
We propose ZeST, a method for zero-shot material transfer to an object in the input image given a material exemplar image. ZeST leverages existing diffusion adapters to extract implicit material representation from the exemplar image. This representation is used to transfer the material using pre-trained inpainting diffusion model on the object in the input image using depth estimates as geometry cue and grayscale object shading as illumination cues. The method works on real images without any training resulting a zero-shot approach. Both qualitative and quantitative results on real and synthetic datasets demonstrate that ZeST outputs photorealistic images with transferred materials. We also show the application of ZeST to perform multiple edits and robust material assignment under different illuminations. Project Page: https://ttchengab.github.io/zest
