Synthetic-To-Real Video Person Re-ID
Xiangqun Zhang, Wei Feng, Ruize Han, Likai Wang, Linqi Song, Junhui Hou
TL;DR
This work tackles cross-domain video-based person Re-ID by leveraging synthetic data to train models that generalize to real-world videos. The authors propose a framework that combines multi-level domain-invariant feature learning with a mean-teacher consistency scheme, augmented by clustering-based ID consistency losses to exploit unlabeled real data. They introduce SVReID and SVReID+ benchmarks to facilitate synthetic-to-real evaluation and demonstrate state-of-the-art performance across five real datasets, with synthetic data sometimes outperforming real data in cross-domain transfers. The study highlights the practical benefits of synthetic video data for scalable Re-ID and provides a benchmark for future research in cross-domain video person Re-ID.
Abstract
Person re-identification (Re-ID) is an important task and has significant applications for public security and information forensics, which has progressed rapidly with the development of deep learning. In this work, we investigate a novel and challenging setting of Re-ID, i.e., cross-domain video-based person Re-ID. Specifically, we utilize synthetic video datasets as the source domain for training and real-world videos for testing, notably reducing the reliance on expensive real data acquisition and annotation. To harness the potential of synthetic data, we first propose a self-supervised domain-invariant feature learning strategy for both static and dynamic (temporal) features. Additionally, to enhance person identification accuracy in the target domain, we propose a mean-teacher scheme incorporating a self-supervised ID consistency loss. Experimental results across five real datasets validate the rationale behind cross-synthetic-real domain adaptation and demonstrate the efficacy of our method. Notably, the discovery that synthetic data outperforms real data in the cross-domain scenario is a surprising outcome. The code and data are publicly available at https://github.com/XiangqunZhang/UDA_Video_ReID.
