Adv-CPG: A Customized Portrait Generation Framework with Facial Adversarial Attacks
Junying Wang, Hongyuan Zhang, Yuan Yuan
TL;DR
Adv-CPG tackles the privacy risks of customized portrait generation by merging facial adversarial attacks with text-guided synthesis. It introduces a three-component framework—ID Encryptor En1, Encryption Enhancer En2, and Multi-Modal Image Customizer MMIC—and a two-stage generation pipeline that first protects privacy and then enables fine-grained portrait customization. Training focuses on MMIC and En1, while inference uses a delayed conditioning scheme to balance identity protection with semantic control. Empirical results show strong black-box attack performance against multiple FR models and commercial APIs, while still delivering high-fidelity, personalized portraits, highlighting a practical approach to safeguarding facial privacy in AI-enabled portrait tools.
Abstract
Recent Customized Portrait Generation (CPG) methods, taking a facial image and a textual prompt as inputs, have attracted substantial attention. Although these methods generate high-fidelity portraits, they fail to prevent the generated portraits from being tracked and misused by malicious face recognition systems. To address this, this paper proposes a Customized Portrait Generation framework with facial Adversarial attacks (Adv-CPG). Specifically, to achieve facial privacy protection, we devise a lightweight local ID encryptor and an encryption enhancer. They implement progressive double-layer encryption protection by directly injecting the target identity and adding additional identity guidance, respectively. Furthermore, to accomplish fine-grained and personalized portrait generation, we develop a multi-modal image customizer capable of generating controlled fine-grained facial features. To the best of our knowledge, Adv-CPG is the first study that introduces facial adversarial attacks into CPG. Extensive experiments demonstrate the superiority of Adv-CPG, e.g., the average attack success rate of the proposed Adv-CPG is 28.1% and 2.86% higher compared to the SOTA noise-based attack methods and unconstrained attack methods, respectively.
