Consistent-Point: Consistent Pseudo-Points for Semi-Supervised Crowd Counting and Localization
Yuda Zou, Zelong Liu, Yuliang Gu, Bo Du, Yongchao Xu
TL;DR
Consistent-Point addresses two forms of pseudo-point inconsistency in semi-supervised, point-localization crowd counting and localization by introducing Position Aggregation (PA) and Instance-wise Uncertainty Calibration (IUC) within a mean-teacher framework built on P2PNet. PA smooths pseudo-point positions by averaging neighboring auxiliary proposal-points, while IUC weights pseudo-points by their classification confidence to stabilize their semantic meaning. Empirically, the approach delivers state-of-the-art localization across five datasets and multiple labeling ratios and also achieves competitive counting results, including surpassing some fully supervised baselines under label scarcity. The work demonstrates that enforcing pseudo-point consistency substantially improves learning signals in semi-supervised crowd counting and localization, with practical benefits for annotation-efficient deployment.
Abstract
Crowd counting and localization are important in applications such as public security and traffic management. Existing methods have achieved impressive results thanks to extensive laborious annotations. This paper propose a novel point-localization-based semi-supervised crowd counting and localization method termed Consistent-Point. We identify and address two inconsistencies of pseudo-points, which have not been adequately explored. To enhance their position consistency, we aggregate the positions of neighboring auxiliary proposal-points. Additionally, an instance-wise uncertainty calibration is proposed to improve the class consistency of pseudo-points. By generating more consistent pseudo-points, Consistent-Point provides more stable supervision to the training process, yielding improved results. Extensive experiments across five widely used datasets and three different labeled ratio settings demonstrate that our method achieves state-of-the-art performance in crowd localization while also attaining impressive crowd counting results. The code will be available.
