Generalization error property of infoGAN for two-layer neural network
Mahmud Hasan, Mathias Muia
TL;DR
This work analyzes the generalization of infoGAN in a two-layer neural network by examining the difference between empirical and population objective functions. It derives finite-sample generalization bounds using Rademacher complexity for the discriminator, generator, and their composition under Lipschitz and non-decreasing activations, within a regularized objective that omits a latent code. The main contributions include explicit bounds for two-layer networks, proofs for both activation regimes, and corollaries translating these into practical discrepancy bounds, showing the empirical-generalization gap shrinks with larger sample sizes. The results highlight how network complexity and activation properties govern generalization in adversarial settings and point to future work on tightening bounds and exploring lower bounds.
Abstract
Information Maximizing Generative Adversarial Network (infoGAN) can be understood as a minimax problem involving two neural networks: discriminators and generators with mutual information functions. The infoGAN incorporates various components, including latent variables, mutual information, and objective function. This research demonstrates the Generalization error property of infoGAN as the discriminator and generator sample size approaches infinity. This research explores the generalization error property of InfoGAN as the sample sizes of the discriminator and generator approach infinity. To establish this property, the study considers the difference between the empirical and population versions of the objective function. The error bound is derived from the Rademacher complexity of the discriminator and generator function classes. Additionally, the bound is proven for a two-layer network, where both the discriminator and generator utilize Lipschitz and non-decreasing activation functions.
