Unsupervised Image-to-Image Translation Networks

Ming-Yu Liu; Thomas Breuel; Jan Kautz

Unsupervised Image-to-Image Translation Networks

Ming-Yu Liu, Thomas Breuel, Jan Kautz

TL;DR

The paper tackles unsupervised image-to-image translation by introducing UNIT, a framework that enforces a shared latent space across two domains via weight-sharing between encoders and generators paired with VAE-GANs. The approach yields bidirectional translation streams and cycle-consistent reconstructions through a combined loss of VAE, GAN, and cycle-consistency terms, learned with alternating optimization. Empirical results demonstrate strong performance across diverse tasks, including map-satellite translation, street scenes, animal and face translations, and unsupervised domain adaptation, with ablations confirming the benefit of the shared latent structure. The work situates itself relative to CoGAN and CycleGAN, arguing that the shared-latent-space constraint naturally induces cycle-consistency while enabling robust, unpaired cross-domain translation. Overall, UNIT offers a versatile, end-to-end framework for cross-domain image translation without paired data and highlights avenues for improved multimodality and stability.

Abstract

Unsupervised image-to-image translation aims at learning a joint distribution of images in different domains by using images from the marginal distributions in individual domains. Since there exists an infinite set of joint distributions that can arrive the given marginal distributions, one could infer nothing about the joint distribution from the marginal distributions without additional assumptions. To address the problem, we make a shared-latent space assumption and propose an unsupervised image-to-image translation framework based on Coupled GANs. We compare the proposed framework with competing approaches and present high quality image translation results on various challenging unsupervised image translation tasks, including street scene image translation, animal image translation, and face image translation. We also apply the proposed framework to domain adaptation and achieve state-of-the-art performance on benchmark datasets. Code and additional results are available in https://github.com/mingyuliutw/unit .

Unsupervised Image-to-Image Translation Networks

TL;DR

Abstract

Unsupervised Image-to-Image Translation Networks

TL;DR

Abstract

Paper Structure

Table of Contents

Figures (3)