A Dual-stage Prompt-driven Privacy-preserving Paradigm for Person Re-Identification

Ruolin Li; Min Liu; Yuan Bian; Zhaoyang Li; Yuzhen Li; Xueping Wang; Yaonan Wang

A Dual-stage Prompt-driven Privacy-preserving Paradigm for Person Re-Identification

Ruolin Li, Min Liu, Yuan Bian, Zhaoyang Li, Yuzhen Li, Xueping Wang, Yaonan Wang

TL;DR

This work tackles privacy concerns in person re-identification by replacing real imagery with a privacy-preserving synthetic dataset generated via a text-driven diffusion model. It introduces DPPP, a dual-stage pipeline: Stage 1 uses rich prompts to synthesize GenePerson with diverse appearances and scenes, and Stage 2 employs a Prompt-driven Disentanglement Mechanism (PDM) that learns style and content pseudo-words to extract domain-invariant content features through CLIP-based contrastive learning. The approach yields state-of-the-art cross-domain generalization on Market-1501 and DukeMTMC-reID, outperforming both real and other synthetic datasets; its best results come from training GenePerson with PDM. By enabling end-to-end virtual data generation and disentangled, content-focused representation learning, the method reduces privacy risks while preserving strong Re-ID performance and offers a scalable path for privacy-safe cross-domain recognition.

Abstract

With growing concerns over data privacy, researchers have started using virtual data as an alternative to sensitive real-world images for training person re-identification (Re-ID) models. However, existing virtual datasets produced by game engines still face challenges such as complex construction and poor domain generalization, making them difficult to apply in real scenarios. To address these challenges, we propose a Dual-stage Prompt-driven Privacy-preserving Paradigm (DPPP). In the first stage, we generate rich prompts incorporating multi-dimensional attributes such as pedestrian appearance, illumination, and viewpoint that drive the diffusion model to synthesize diverse data end-to-end, building a large-scale virtual dataset named GenePerson with 130,519 images of 6,641 identities. In the second stage, we propose a Prompt-driven Disentanglement Mechanism (PDM) to learn domain-invariant generalization features. With the aid of contrastive learning, we employ two textual inversion networks to map images into pseudo-words representing style and content, respectively, thereby constructing style-disentangled content prompts to guide the model in learning domain-invariant content features at the image level. Experiments demonstrate that models trained on GenePerson with PDM achieve state-of-the-art generalization performance, surpassing those on popular real and virtual Re-ID datasets.

A Dual-stage Prompt-driven Privacy-preserving Paradigm for Person Re-Identification

TL;DR

Abstract

A Dual-stage Prompt-driven Privacy-preserving Paradigm for Person Re-Identification

TL;DR

Abstract

Paper Structure

Table of Contents

Figures (6)