Character-Adapter: Prompt-Guided Region Control for High-Fidelity Character Customization
Yuhang Ma, Wenting Xu, Jiji Tang, Qinfeng Jin, Rongsheng Zhang, Zeng Zhao, Changjie Fan, Zhipeng Hu
TL;DR
Character-Adapter addresses the challenge of maintaining high-fidelity character details in image synthesis by introducing a plug-and-play framework that uses prompt-guided segmentation to localize character regions and dynamic region-level adapters to preserve region-specific features. By leveraging cross-attention cues within diffusion models and applying region-specific fusion, it enables both single- and multi-character generation without additional training. The approach achieves state-of-the-art zero-shot character consistency—demonstrated by substantial improvements in CLIP-I and DINO-I metrics—while maintaining strong text–image alignment and computational efficiency. This method enhances practical applicability for storytelling, portrait design, and character-centric editing by offering flexible, region-aware control and broad compatibility with existing editing tools.
Abstract
Customized image generation, which seeks to synthesize images with consistent characters, holds significant relevance for applications such as storytelling, portrait generation, and character design. However, previous approaches have encountered challenges in preserving characters with high-fidelity consistency due to inadequate feature extraction and concept confusion of reference characters. Therefore, we propose Character-Adapter, a plug-and-play framework designed to generate images that preserve the details of reference characters, ensuring high-fidelity consistency. Character-Adapter employs prompt-guided segmentation to ensure fine-grained regional features of reference characters and dynamic region-level adapters to mitigate concept confusion. Extensive experiments are conducted to validate the effectiveness of Character-Adapter. Both quantitative and qualitative results demonstrate that Character-Adapter achieves the state-of-the-art performance of consistent character generation, with an improvement of 24.8% compared with other methods. Our code will be released at https://github.com/Character-Adapter/Character-Adapter.
