Good Semi-supervised Learning that Requires a Bad GAN
Zihang Dai, Zhilin Yang, Fan Yang, William W. Cohen, Ruslan Salakhutdinov
TL;DR
<3-5 sentence high-level summary>This work analyzes GAN-based semi-supervised learning and reveals that a perfect generator provides no SSL gain, while a carefully designed complement generator can place decision boundaries in low-density regions of the feature space. It introduces a practical framework that (i) increases generator entropy, (ii) generates low-density samples, and (iii) adds a conditional-entropy term to enforce strong true-fake beliefs, collectively approximating a KL divergence minimization to a complement distribution $p^*(x)$. The approach yields substantial empirical gains on MNIST, SVHN, and CIFAR-10 with small discriminators, achieving state-of-the-art single-model results and clarifying the trade-offs between generator quality and SSL performance. These insights offer a principled path to robust SSL with GANs and have practical implications for designing discriminator-guided generators in semi-supervised visual tasks.
Abstract
Semi-supervised learning methods based on generative adversarial networks (GANs) obtained strong empirical results, but it is not clear 1) how the discriminator benefits from joint training with a generator, and 2) why good semi-supervised classification performance and a good generator cannot be obtained at the same time. Theoretically, we show that given the discriminator objective, good semisupervised learning indeed requires a bad generator, and propose the definition of a preferred generator. Empirically, we derive a novel formulation based on our analysis that substantially improves over feature matching GANs, obtaining state-of-the-art results on multiple benchmark datasets.
