Nickel and Diming Your GAN: A Dual-Method Approach to Enhancing GAN Efficiency via Knowledge Distillation
Sangyeop Yeo, Yoojin Jang, Jaejun Yoo
TL;DR
This paper tackles deploying GANs in resource‑constrained settings by introducing two complementary methods, DiME and NICKEL. DiME performs distribution matching in embedding spaces via Maximum Mean Discrepancy using foundation kernels to align $G^T$ and $G^S$, while NICKEL enhances stability by transferring knowledge through the discriminator ($D^S$) and its feedback to the student generator ($G^S$). On StyleGAN2 with FFHQ, the approach yields state‑of‑the‑art compression results (e.g., FID up to 15.93 at 98.92% compression and 29.38 at 99.69%), maintaining perceptual quality at extreme pruning. Collectively, these methods significantly reduce GAN compute while preserving high image fidelity, enabling practical deployment in limited‑resource environments.
Abstract
In this paper, we address the challenge of compressing generative adversarial networks (GANs) for deployment in resource-constrained environments by proposing two novel methodologies: Distribution Matching for Efficient compression (DiME) and Network Interactive Compression via Knowledge Exchange and Learning (NICKEL). DiME employs foundation models as embedding kernels for efficient distribution matching, leveraging maximum mean discrepancy to facilitate effective knowledge distillation. Simultaneously, NICKEL employs an interactive compression method that enhances the communication between the student generator and discriminator, achieving a balanced and stable compression process. Our comprehensive evaluation on the StyleGAN2 architecture with the FFHQ dataset shows the effectiveness of our approach, with NICKEL & DiME achieving FID scores of 10.45 and 15.93 at compression rates of 95.73% and 98.92%, respectively. Remarkably, our methods sustain generative quality even at an extreme compression rate of 99.69%, surpassing the previous state-of-the-art performance by a large margin. These findings not only demonstrate our methodologies' capacity to significantly lower GANs' computational demands but also pave the way for deploying high-quality GAN models in settings with limited resources. Our code will be released soon.
