Rethinking Sampling Strategies for Unsupervised Person Re-identification
Xumeng Han, Xuehui Yu, Guorong Li, Jian Zhao, Gang Pan, Qixiang Ye, Jianbin Jiao, Zhenjun Han
TL;DR
This paper identifies sampling strategy as a crucial factor in unsupervised person re-ID, introducing deteriorated over-fitting and statistical stability to explain why random sampling collapses learning while structured sampling can succeed. It proposes group sampling, which groups samples from the same class to emphasize class-wide trends and suppress the influence of individual samples, thereby improving pseudo-label quality and representation learning. Through extensive experiments on Market-1501, DukeMTMC-reID, and MSMT17, group sampling achieves competitive or superior performance to state-of-the-art fully unsupervised methods without additional parameters or computation, particularly under camera-agnostic settings. The findings highlight a practical, low-cost approach to enhance unsupervised re-ID by focusing on sampling design to sustain within-class coherence and inter-class separation during contrastive learning.
Abstract
Unsupervised person re-identification (re-ID) remains a challenging task. While extensive research has focused on the framework design and loss function, this paper shows that sampling strategy plays an equally important role. We analyze the reasons for the performance differences between various sampling strategies under the same framework and loss function. We suggest that deteriorated over-fitting is an important factor causing poor performance, and enhancing statistical stability can rectify this problem. Inspired by that, a simple yet effective approach is proposed, termed group sampling, which gathers samples from the same class into groups. The model is thereby trained using normalized group samples, which helps alleviate the negative impact of individual samples. Group sampling updates the pipeline of pseudo-label generation by guaranteeing that samples are more efficiently classified into the correct classes. It regulates the representation learning process, enhancing statistical stability for feature representation in a progressive fashion. Extensive experiments on Market-1501, DukeMTMC-reID and MSMT17 show that group sampling achieves performance comparable to state-of-the-art methods and outperforms the current techniques under purely camera-agnostic settings. Code has been available at https://github.com/ucas-vg/GroupSampling.
