SwipeGANSpace: Swipe-to-Compare Image Generation via Efficient Latent Space Exploration
Yuto Nakashima, Mingzhe Yang, Yukino Baba
TL;DR
This work addresses the challenge of generating user-preferred images from the high-dimensional StyleGAN latent space by introducing a swipe-to-compare interface. It combines PCA-based latent-space reduction (to form a manageable subspace) with preferential Bayesian optimization and a multi-armed bandit to dynamically identify the most relevant latent dimensions to explore, mapping results back to the full latent space for image synthesis. Through simulation and user experiments, the method demonstrates superior efficiency in converging to user-preferred images and reveals that user preferences can shift during comparisons, which the approach accommodates through adaptive exploration. The results support a smartphone-friendly, human-centered workflow for personalized image generation, with future work focusing on faster search and enhanced visualization of latent-manipulation changes.
Abstract
Generating preferred images using generative adversarial networks (GANs) is challenging owing to the high-dimensional nature of latent space. In this study, we propose a novel approach that uses simple user-swipe interactions to generate preferred images for users. To effectively explore the latent space with only swipe interactions, we apply principal component analysis to the latent space of the StyleGAN, creating meaningful subspaces. We use a multi-armed bandit algorithm to decide the dimensions to explore, focusing on the preferences of the user. Experiments show that our method is more efficient in generating preferred images than the baseline methods. Furthermore, changes in preferred images during image generation or the display of entirely different image styles were observed to provide new inspirations, subsequently altering user preferences. This highlights the dynamic nature of user preferences, which our proposed approach recognizes and enhances.
