Assessing Open-world Forgetting in Generative Image Model Customization
Héctor Laria, Alex Gomez-Villa, Kai Wang, Bogdan Raducanu, Joost van de Weijer
TL;DR
This work defines open-world forgetting in diffusion-model customization, demonstrating that even tiny fine-tuning updates can cause substantial semantic and appearance drift across a model's broad latent space. It develops evaluation methods based on zero-shot classification for semantic drift and distribution-based color metrics, notably the Color Drift Index (CDI), for appearance drift, and introduces a Drift Correction loss (\mathcal{L}_{DC}) to preserve prior capabilities while learning new concepts. Empirical results show significant semantic accuracy drops (worst-case >60%) without mitigation, which are substantially reduced by the proposed method, with DRIFT correction also preserving diversity and improving user-perceived fidelity. The findings highlight the need to account for open-world forgetting in model customization and offer a practical, reproducible framework and mitigation strategy to enhance the reliability of personalized diffusion models in real-world applications.
Abstract
Recent advances in diffusion models have significantly enhanced image generation capabilities. However, customizing these models with new classes often leads to unintended consequences that compromise their reliability. We introduce the concept of open-world forgetting to characterize the vast scope of these unintended alterations. Our work presents the first systematic investigation into open-world forgetting in diffusion models, focusing on semantic and appearance drift of representations. Using zero-shot classification, we demonstrate that even minor model adaptations can lead to significant semantic drift affecting areas far beyond newly introduced concepts, with accuracy drops of up to 60% on previously learned concepts. Our analysis of appearance drift reveals substantial changes in texture and color distributions of generated content. To address these issues, we propose a functional regularization strategy that effectively preserves original capabilities while accommodating new concepts. Through extensive experiments across multiple datasets and evaluation metrics, we demonstrate that our approach significantly reduces both semantic and appearance drift. Our study highlights the importance of considering open-world forgetting in future research on model customization and finetuning methods.
