FaceChain-SuDe: Building Derived Class to Inherit Category Attributes for One-shot Subject-Driven Generation
Pengchong Qiao, Lei Shang, Chang Liu, Baigui Sun, Xiangyang Ji, Jie Chen
TL;DR
FaceChain-SuDe addresses the attribute-missing problem in one-shot subject-driven generation by modeling a subject as a derived class of its semantic category and introducing Subject-Derivation Regularization (SuDe). SuDe couples a private-attribute reconstruction loss with a category-inheritance loss that leverages the implicit diffusion classifier to encourage generated images to semantically belong to the subject’s category, while preserving subject fidelity. The method is plug-and-play and improves attribute alignment (BLIP-T) across DreamBooth, Custom Diffusion, and ViCo on multiple SD backbones, with stability ensured by a loss-truncation strategy. This approach broadens practical one-shot personalization by enabling more imaginative attribute-related generations without sacrificing subject identity.
Abstract
Subject-driven generation has garnered significant interest recently due to its ability to personalize text-to-image generation. Typical works focus on learning the new subject's private attributes. However, an important fact has not been taken seriously that a subject is not an isolated new concept but should be a specialization of a certain category in the pre-trained model. This results in the subject failing to comprehensively inherit the attributes in its category, causing poor attribute-related generations. In this paper, motivated by object-oriented programming, we model the subject as a derived class whose base class is its semantic category. This modeling enables the subject to inherit public attributes from its category while learning its private attributes from the user-provided example. Specifically, we propose a plug-and-play method, Subject-Derived regularization (SuDe). It constructs the base-derived class modeling by constraining the subject-driven generated images to semantically belong to the subject's category. Extensive experiments under three baselines and two backbones on various subjects show that our SuDe enables imaginative attribute-related generations while maintaining subject fidelity. Codes will be open sourced soon at FaceChain (https://github.com/modelscope/facechain).
