WeditGAN: Few-Shot Image Generation via Latent Space Relocation
Yuxuan Duan, Li Niu, Yan Hong, Liqing Zhang
TL;DR
WeditGAN tackles few-shot image generation by transferring a pretrained StyleGAN through latent-space relocation using a fixed offset $\Delta w$, relocating the target latent space $W_{\text{tgt}}^+$ from the source $W_{\text{src}}^+$. By freezing the mapping and synthesis networks and only learning $\Delta w$, the approach preserves source-domain diversity while achieving target-domain fidelity; it further extends to layer-wise $W^+$ with AlphaModules for per-$w_{src}$ editing and adds optional perpendicular regularization and contrastive-based regularization to improve robustness. The method shows state-of-the-art performance across eight source/target pairs in 10-shot settings, offering competitive fidelity and high diversity without extensive parameter updates. The work demonstrates a simple, effective transfer mechanism that leverages the geometry of StyleGAN latent spaces, with open-source code and clear guidance for extensions in per-layer relocation, editing intensity, and orthogonality to reg-based methods.
Abstract
In few-shot image generation, directly training GAN models on just a handful of images faces the risk of overfitting. A popular solution is to transfer the models pretrained on large source domains to small target ones. In this work, we introduce WeditGAN, which realizes model transfer by editing the intermediate latent codes $w$ in StyleGANs with learned constant offsets ($Δw$), discovering and constructing target latent spaces via simply relocating the distribution of source latent spaces. The established one-to-one mapping between latent spaces can naturally prevents mode collapse and overfitting. Besides, we also propose variants of WeditGAN to further enhance the relocation process by regularizing the direction or finetuning the intensity of $Δw$. Experiments on a collection of widely used source/target datasets manifest the capability of WeditGAN in generating realistic and diverse images, which is simple yet highly effective in the research area of few-shot image generation. Codes are available at https://github.com/Ldhlwh/WeditGAN.
