LIPE: Learning Personalized Identity Prior for Non-rigid Image Editing
Aoyang Liu, Qingnan Fan, Shuai Qin, Hong Gu, Yansong Tang
TL;DR
This work tackles the challenge of non-rigid image editing while preserving subject identity by learning a personalized identity prior from only a few reference images. It introduces a two-stage LIPE framework: (1) data-augmented learning of a subject-specific prior by fine-tuning a diffusion model on attention—updates limited to the attention layers, and (2) a non-rigid editing mechanism called NIMA that uses identity-aware cross-attention masks to guide latent blending during denoising. The authors also present LIPE, a dedicated dataset spanning objects, animals, and humans, and demonstrate through qualitative and quantitative evaluations that LIPE outperforms strong baselines in identity preservation, background fidelity, and prompt alignment for non-rigid edits. The approach offers a practical path toward controllable, identity-consistent image editing with minimal target subject data, supported by a dataset and comprehensive analyses.
Abstract
Although recent years have witnessed significant advancements in image editing thanks to the remarkable progress of text-to-image diffusion models, the problem of non-rigid image editing still presents its complexities and challenges. Existing methods often fail to achieve consistent results due to the absence of unique identity characteristics. Thus, learning a personalized identity prior might help with consistency in the edited results. In this paper, we explore a novel task: learning the personalized identity prior for text-based non-rigid image editing. To address the problems in jointly learning prior and editing the image, we present LIPE, a two-stage framework designed to customize the generative model utilizing a limited set of images of the same subject, and subsequently employ the model with learned prior for non-rigid image editing. Experimental results demonstrate the advantages of our approach in various editing scenarios over past related leading methods in qualitative and quantitative ways.
