Editing in Style: Uncovering the Local Semantics of GANs

Edo Collins; Raja Bala; Bob Price; Sabine Süsstrunk

Editing in Style: Uncovering the Local Semantics of GANs

Edo Collins, Raja Bala, Bob Price, Sabine Süsstrunk

TL;DR

This work reveals that StyleGAN learns spatially disentangled semantic objects and parts in its latent space, enabling local, semantically aware edits without external supervision. It introduces a ROI-guided style-transfer mechanism that transfers appearance from a reference image by conditioning style interpolation with a diagonal query matrix, leveraging a semantic cluster catalog produced via spherical k-means. Quantitative and qualitative evaluations on FFHQ, LSUN-Bedrooms, and StyleGAN2 demonstrate localized edits that preserve photorealism, outperforming naive blending methods in locality. The approach offers a practical route to versatile image editing with potential extensions to real-image editing through latent-space embedding.

Abstract

While the quality of GAN image synthesis has improved tremendously in recent years, our ability to control and condition the output is still limited. Focusing on StyleGAN, we introduce a simple and effective method for making local, semantically-aware edits to a target output image. This is accomplished by borrowing elements from a source image, also a GAN output, via a novel manipulation of style vectors. Our method requires neither supervision from an external model, nor involves complex spatial morphing operations. Instead, it relies on the emergent disentanglement of semantic objects that is learned by StyleGAN during its training. Semantic editing is demonstrated on GANs producing human faces, indoor scenes, cats, and cars. We measure the locality and photorealism of the edits produced by our method, and find that it accomplishes both.

Editing in Style: Uncovering the Local Semantics of GANs

TL;DR

Abstract

Editing in Style: Uncovering the Local Semantics of GANs

TL;DR

Abstract

Paper Structure

Table of Contents

Figures (19)