CCUP: A Controllable Synthetic Data Generation Pipeline for Pretraining Cloth-Changing Person Re-Identification Models
Yujian Zhao, Chengru Wu, Yinong Xu, Xuanzheng Du, Ruiyu Li, Guanglin Niu
TL;DR
Cloth-changing Re-Identification is hampered by limited labeled data across outfits. The authors propose CCUP, a large-scale, controllable synthetic CC-ReID data generation pipeline built on MakeHuman and Unreal Engine, producing 6,000 IDs, 1,179,976 images, 100 cameras, and an average of 26.5 outfits per identity. They validate a pretrain-finetune framework that pretrains TransReID and FIRe^2 on CCUP and finetunes on PRCC, VC-Clothes, NKUP, achieving state-of-the-art results and demonstrating improved cloth-irrelevant feature learning. Visualizations indicate CCUP pretraining shifts attention to body regions and key body parts, supporting improved generalization to cloth-changing scenarios. Overall, CCUP provides a cost-effective, scalable path to robust CC-ReID models and clearer avenues for future improvements in cloth-irrelevant representation learning.
Abstract
Cloth-changing person re-identification (CC-ReID), also known as Long-Term Person Re-Identification (LT-ReID) is a critical and challenging research topic in computer vision that has recently garnered significant attention. However, due to the high cost of constructing CC-ReID data, the existing data-driven models are hard to train efficiently on limited data, causing overfitting issue. To address this challenge, we propose a low-cost and efficient pipeline for generating controllable and high-quality synthetic data simulating the surveillance of real scenarios specific to the CC-ReID task. Particularly, we construct a new self-annotated CC-ReID dataset named Cloth-Changing Unreal Person (CCUP), containing 6,000 IDs, 1,179,976 images, 100 cameras, and 26.5 outfits per individual. Based on this large-scale dataset, we introduce an effective and scalable pretrain-finetune framework for enhancing the generalization capabilities of the traditional CC-ReID models. The extensive experiments demonstrate that two typical models namely TransReID and FIRe^2, when integrated into our framework, outperform other state-of-the-art models after pretraining on CCUP and finetuning on the benchmarks such as PRCC, VC-Clothes and NKUP. The CCUP is available at: https://github.com/yjzhao1019/CCUP.
