Generative Active Learning for Image Synthesis Personalization
Xulu Zhang, Wengyu Zhang, Xiao-Yong Wei, Jinlin Wu, Zhaoxiang Zhang, Zhen Lei, Qing Li
TL;DR
This work explores applying active learning to generative image synthesis personalization (ISP) by converting open-ended querying into a semi-open, anchor-direction framework. It introduces a distribution-based uncertainty sampling strategy and a balancing mechanism to navigate exploitation-exploration in GAL, enabling efficient use of synthetic samples. Empirical results on style- and object-driven ISP show that GAL with uncertainty and balance can match or surpass state-of-the-art methods, including some closed-source approaches, while leveraging open-source diffusion models. The approach reduces annotation burden and demonstrates a practical path toward data-efficient, personalized image synthesis with generative models.
Abstract
This paper presents a pilot study that explores the application of active learning, traditionally studied in the context of discriminative models, to generative models. We specifically focus on image synthesis personalization tasks. The primary challenge in conducting active learning on generative models lies in the open-ended nature of querying, which differs from the closed form of querying in discriminative models that typically target a single concept. We introduce the concept of anchor directions to transform the querying process into a semi-open problem. We propose a direction-based uncertainty sampling strategy to enable generative active learning and tackle the exploitation-exploration dilemma. Extensive experiments are conducted to validate the effectiveness of our approach, demonstrating that an open-source model can achieve superior performance compared to closed-source models developed by large companies, such as Google's StyleDrop. The source code is available at https://github.com/zhangxulu1996/GAL4Personalization.
