Unleashing the Potential of Pre-Trained Diffusion Models for Generalizable Person Re-Identification
Jiachen Li, Xiaojin Gong
TL;DR
The paper tackles domain-generalizable person re-identification (DG Re-ID) by addressing shortcut learning in discriminative setups. It introduces DCAC, a diffusion model-assisted representation learning framework that couples a CLIP-based Re-ID backbone with a pre-trained diffusion model via a correlation-aware conditioning scheme using ID-wise prompts and dark knowledge from classification logits. The diffusion model is adapted with LoRA adapters to balance preserving pre-trained knowledge and enabling downstream adaptation, with gradients flowing back to the Re-ID model to improve generalization. Across single-source and multi-source DG Re-ID benchmarks, DCAC achieves state-of-the-art or competitive results and is supported by extensive ablations validating the conditioning strategy, diffusion assistance, and efficiency advantages.
Abstract
Domain-generalizable re-identification (DG Re-ID) aims to train a model on one or more source domains and evaluate its performance on unseen target domains, a task that has attracted growing attention due to its practical relevance. While numerous methods have been proposed, most rely on discriminative or contrastive learning frameworks to learn generalizable feature representations. However, these approaches often fail to mitigate shortcut learning, leading to suboptimal performance. In this work, we propose a novel method called diffusion model-assisted representation learning with a correlation-aware conditioning scheme (DCAC) to enhance DG Re-ID. Our method integrates a discriminative and contrastive Re-ID model with a pre-trained diffusion model through a correlation-aware conditioning scheme. By incorporating ID classification probabilities generated from the Re-ID model with a set of learnable ID-wise prompts, the conditioning scheme injects dark knowledge that captures ID correlations to guide the diffusion process. Simultaneously, feedback from the diffusion model is back-propagated through the conditioning scheme to the Re-ID model, effectively improving the generalization capability of Re-ID features. Extensive experiments on both single-source and multi-source DG Re-ID tasks demonstrate that our method achieves state-of-the-art performance. Comprehensive ablation studies further validate the effectiveness of the proposed approach, providing insights into its robustness. Codes will be available at https://github.com/RikoLi/DCAC.
