Table of Contents
Fetching ...

Rethinking Clothes Changing Person ReID: Conflicts, Synthesis, and Optimization

Junjie Li, Guanshuo Wang, Fufu Yu, Yichao Yan, Qiong Jia, Shouhong Ding, Xingdong Sheng, Yunhui Liu, Xiaokang Yang

TL;DR

This work tackles CC-ReID by revealing intrinsic conflicts between clothing-invariant identity cues and clothing-dependent features, and by proposing a two-pronged solution: high-fidelity clothes-varying data synthesis via Clothes-Changing Diffusion (CC-Diffusion) and a multi-objective optimization (MOO) framework to balance competing objectives. The CC-Diffusion model generates controllable, identity-consistent clothes-varying images to augment CC data, while the CC-ReID learning is reformulated into three objectives: $\mathcal{L}_{id}$, $\mathcal{L}_{sc}$, and $\mathcal{L}_{cc}$, optimized through gradient-based Pareto methods and guided by human preference vectors to achieve desired trade-offs. Key contributions include exposing the objective conflict in CC-ReID, introducing high-quality synthetic data, and delivering a model-agnostic MOO solution that yields Pareto-optimal and practically balanced performance under both CC and standard ReID protocols. The approach demonstrates significant CC improvements with controlled SC performance across PRCC and CCVID datasets, highlighting the practical impact of combining data synthesis with principled multi-objective optimization in CC-ReID.

Abstract

Clothes-changing person re-identification (CC-ReID) aims to retrieve images of the same person wearing different outfits. Mainstream researches focus on designing advanced model structures and strategies to capture identity information independent of clothing. However, the same-clothes discrimination as the standard ReID learning objective in CC-ReID is persistently ignored in previous researches. In this study, we dive into the relationship between standard and clothes-changing~(CC) learning objectives, and bring the inner conflicts between these two objectives to the fore. We try to magnify the proportion of CC training pairs by supplementing high-fidelity clothes-varying synthesis, produced by our proposed Clothes-Changing Diffusion model. By incorporating the synthetic images into CC-ReID model training, we observe a significant improvement under CC protocol. However, such improvement sacrifices the performance under the standard protocol, caused by the inner conflict between standard and CC. For conflict mitigation, we decouple these objectives and re-formulate CC-ReID learning as a multi-objective optimization (MOO) problem. By effectively regularizing the gradient curvature across multiple objectives and introducing preference restrictions, our MOO solution surpasses the single-task training paradigm. Our framework is model-agnostic, and demonstrates superior performance under both CC and standard ReID protocols.

Rethinking Clothes Changing Person ReID: Conflicts, Synthesis, and Optimization

TL;DR

This work tackles CC-ReID by revealing intrinsic conflicts between clothing-invariant identity cues and clothing-dependent features, and by proposing a two-pronged solution: high-fidelity clothes-varying data synthesis via Clothes-Changing Diffusion (CC-Diffusion) and a multi-objective optimization (MOO) framework to balance competing objectives. The CC-Diffusion model generates controllable, identity-consistent clothes-varying images to augment CC data, while the CC-ReID learning is reformulated into three objectives: , , and , optimized through gradient-based Pareto methods and guided by human preference vectors to achieve desired trade-offs. Key contributions include exposing the objective conflict in CC-ReID, introducing high-quality synthetic data, and delivering a model-agnostic MOO solution that yields Pareto-optimal and practically balanced performance under both CC and standard ReID protocols. The approach demonstrates significant CC improvements with controlled SC performance across PRCC and CCVID datasets, highlighting the practical impact of combining data synthesis with principled multi-objective optimization in CC-ReID.

Abstract

Clothes-changing person re-identification (CC-ReID) aims to retrieve images of the same person wearing different outfits. Mainstream researches focus on designing advanced model structures and strategies to capture identity information independent of clothing. However, the same-clothes discrimination as the standard ReID learning objective in CC-ReID is persistently ignored in previous researches. In this study, we dive into the relationship between standard and clothes-changing~(CC) learning objectives, and bring the inner conflicts between these two objectives to the fore. We try to magnify the proportion of CC training pairs by supplementing high-fidelity clothes-varying synthesis, produced by our proposed Clothes-Changing Diffusion model. By incorporating the synthetic images into CC-ReID model training, we observe a significant improvement under CC protocol. However, such improvement sacrifices the performance under the standard protocol, caused by the inner conflict between standard and CC. For conflict mitigation, we decouple these objectives and re-formulate CC-ReID learning as a multi-objective optimization (MOO) problem. By effectively regularizing the gradient curvature across multiple objectives and introducing preference restrictions, our MOO solution surpasses the single-task training paradigm. Our framework is model-agnostic, and demonstrates superior performance under both CC and standard ReID protocols.
Paper Structure (11 sections, 11 equations, 4 figures, 4 tables)

This paper contains 11 sections, 11 equations, 4 figures, 4 tables.

Figures (4)

  • Figure 1: The overview of our framework and the pipeline of generating synthesis with Clothes-Changing Diffusion model. The learning process of standard and CC pairs are decomposed to jointly optimize for pareto-optimal solutions.
  • Figure 2: Synthetic results on PRCC, every 4 persons is grouped together. The first row contains original images $I$, the second row refers to clothes images $I'$. The other rows represent clothes-varying synthesis $I_{CC}$ by swapping clothing in the group.
  • Figure 3: Illustration of optimization with preference. (a): The whole objective space is divided by a set of planes. (b): The shift of pareto front caused by directly weighting objectives in formulation \ref{['eq:opt_goal']}. Our method would restrict solution to a chosen sub-space.
  • Figure 4: Comparison between linear scalarization and gradient-based optimization with preference. Rhombus points denote results of linear scalarization with a fixed weights for ($\mathcal{L}_{sc}, \mathcal{L}_{cc}$). Circular points refer to GBO methods with a selected preference representing the angle in the plane $\mathcal{L}_{sc}, \mathcal{L}_{cc}$ (i.e. GBO $45^\circ$ means the vector is ($\frac{\sqrt{2}}{2}, \frac{\sqrt{2}}{2}$)).