Disentangled Inference for GANs with Latently Invertible Autoencoder

Jiapeng Zhu; Deli Zhao; Bo Zhang; Bolei Zhou

Disentangled Inference for GANs with Latently Invertible Autoencoder

Jiapeng Zhu, Deli Zhao, Bo Zhang, Bolei Zhou

TL;DR

The paper tackles the critical problem of enabling real-image inference for GANs by addressing latent-space entanglement that hampers encoder learning. It introduces Latently Invertible Autoencoder (LIA), a framework that embeds an invertible mapping between disentangled latent spaces in a two-stage training regime, allowing accurate reconstruction and efficient inference. Empirical results on FFHQ and LSUN demonstrate improved reconstruction quality and versatile image-editing capabilities, while ablations show the necessity of the disentangled $\bm y$-space and the invertible bridge. The approach offers a practical path to GAN inversion and editing for real images, with implications for data augmentation, few-shot learning, and 3D vision tasks.

Abstract

Generative Adversarial Networks (GANs) play an increasingly important role in machine learning. However, there is one fundamental issue hindering their practical applications: the absence of capability for encoding real-world samples. The conventional way of addressing this issue is to learn an encoder for GAN via Variational Auto-Encoder (VAE). In this paper, we show that the entanglement of the latent space for the VAE/GAN framework poses the main challenge for encoder learning. To address the entanglement issue and enable inference in GAN we propose a novel algorithm named Latently Invertible Autoencoder (LIA). The framework of LIA is that an invertible network and its inverse mapping are symmetrically embedded in the latent space of VAE. The decoder of LIA is first trained as a standard GAN with the invertible network and then the partial encoder is learned from a disentangled autoencoder by detaching the invertible network from LIA, thus avoiding the entanglement problem caused by the random latent space. Experiments conducted on the FFHQ face dataset and three LSUN datasets validate the effectiveness of LIA/GAN.

Disentangled Inference for GANs with Latently Invertible Autoencoder

TL;DR

Abstract

Disentangled Inference for GANs with Latently Invertible Autoencoder

TL;DR

Abstract

Paper Structure

Table of Contents

Figures (12)