AgeBooth: Controllable Facial Aging and Rejuvenation via Diffusion Models
Shihao Zhu, Bohan Cao, Ziheng Ouyang, Zhen Li, Peng-Tao Jiang, Qibin Hou
TL;DR
AgeBooth tackles controllable facial aging with identity preservation in diffusion-based text-to-image generation. It introduces a two-pronged approach: few-shot age-specific finetuning with LoRA adapters to embed aging concepts, and a training-free LoRA and prompt fusion framework using SVD-based fusion (SVDMix) plus prompt interpolation to achieve smooth, continuous age transitions while maintaining identity via an ID adapter. The method yields accurate age control and high-quality, identity-consistent portraits across ages from a single reference image, outperforming prior editing-based approaches in both age accuracy (via MiVOLO) and aesthetics (via LAION predictor). Extensive ablations and cross-backbone evaluations demonstrate the robustness of the aging interpolation and the practical, plug-in nature of AgeBooth for personalized diffusion-based generation.
Abstract
Recent diffusion model research focuses on generating identity-consistent images from a reference photo, but they struggle to accurately control age while preserving identity, and fine-tuning such models often requires costly paired images across ages. In this paper, we propose AgeBooth, a novel age-specific finetuning approach that can effectively enhance the age control capability of adapterbased identity personalization models without the need for expensive age-varied datasets. To reduce dependence on a large amount of age-labeled data, we exploit the linear nature of aging by introducing age-conditioned prompt blending and an age-specific LoRA fusion strategy that leverages SVDMix, a matrix fusion technique. These techniques enable high-quality generation of intermediate-age portraits. Our AgeBooth produces realistic and identity-consistent face images across different ages from a single reference image. Experiments show that AgeBooth achieves superior age control and visual quality compared to previous state-of-the-art editing-based methods.
