Learning to Discover Cross-Domain Relations with Generative Adversarial Networks
Taeksoo Kim, Moonsu Cha, Hyunsoo Kim, Jung Kwon Lee, Jiwon Kim
TL;DR
This work tackles unsupervised discovery of cross-domain relations from unpaired image datasets by proposing DiscoGAN, a dual-GAN framework with reconstruction losses that enforce bidirectional, bijective mappings between two domains. By coupling two GANs and introducing two cycle-consistency reconstruction terms, DiscoGAN avoids mode collapse and achieves robust, invertible translations across diverse domain pairs. The approach is validated on toy and real-domain tasks (e.g., car, face, chair, edges-to-photos, handbags↔shoes), demonstrating accurate, attribute-preserving cross-domain mappings without explicit pair labels. Overall, DiscoGAN enables natural style transfer and relation discovery in settings where paired data is scarce or unavailable, with broad applicability to image-to-image translation and multi-domain learning.
Abstract
While humans easily recognize relations between data from different domains without any supervision, learning to automatically discover them is in general very challenging and needs many ground-truth pairs that illustrate the relations. To avoid costly pairing, we address the task of discovering cross-domain relations given unpaired data. We propose a method based on generative adversarial networks that learns to discover relations between different domains (DiscoGAN). Using the discovered relations, our proposed network successfully transfers style from one domain to another while preserving key attributes such as orientation and face identity. Source code for official implementation is publicly available https://github.com/SKTBrain/DiscoGAN
