Image2StyleGAN++: How to Edit the Embedded Images?
Rameen Abdal, Yipeng Qin, Peter Wonka
TL;DR
Image2StyleGAN++ advances real-image embedding into StyleGAN by adding a Noise space optimization step to recover high-frequency details, enabling PSNR gains up to ~45 dB. It extends the W+ latent embedding with local masks and layer-specific constraints, allowing partial or approximate embeddings and editable regions. By combining embedding with activation-tensor manipulations (spatial, channel-wise, and averaging operations), the framework supports high-quality local edits alongside global semantic transformations, enabling applications like image reconstruction, inpainting, crossover, local scribble edits, local style transfer, and attribute-level feature transfer. The method demonstrates superior reconstruction quality, flexible local control, and broad applicability, with potential extensions to video editing in future work.
Abstract
We propose Image2StyleGAN++, a flexible image editing framework with many applications. Our framework extends the recent Image2StyleGAN in three ways. First, we introduce noise optimization as a complement to the $W^+$ latent space embedding. Our noise optimization can restore high-frequency features in images and thus significantly improves the quality of reconstructed images, e.g. a big increase of PSNR from 20 dB to 45 dB. Second, we extend the global $W^+$ latent space embedding to enable local embeddings. Third, we combine embedding with activation tensor manipulation to perform high-quality local edits along with global semantic edits on images. Such edits motivate various high-quality image editing applications, e.g. image reconstruction, image inpainting, image crossover, local style transfer, image editing using scribbles, and attribute level feature transfer. Examples of the edited images are shown across the paper for visual inspection.
