Human-in-the-Loop Visual Re-ID for Population Size Estimation
Gustavo Perez, Daniel Sheldon, Grant Van Horn, Subhransu Maji
TL;DR
The work tackles estimating population size in large image collections despite imperfect Re-ID by introducing a human-in-the-loop estimator based on nested importance sampling that leverages pairwise similarity to produce unbiased counts $k=|\mathcal{Y}|$ with confidence intervals. It defines proposal distributions derived from approximate similarities and derives an estimator $\widehat{CC}_{N,M}$ that requires only $N\times M$ human queries, with proven asymptotic normality and bias $O(1/M)$. The approach outperforms strong baselines across seven animal datasets, delivering accurate estimates with extremely small human effort (often $<0.002\%$ of all pairs) and calibrated CIs, making it practical for wildlife monitoring and generalized category discovery. The method is deployment-ready on top of any Re-ID system and provides a principled way to quantify uncertainty and guide human labeling.
Abstract
Computer vision-based re-identification (Re-ID) systems are increasingly being deployed for estimating population size in large image collections. However, the estimated size can be significantly inaccurate when the task is challenging or when deployed on data from new distributions. We propose a human-in-the-loop approach for estimating population size driven by a pairwise similarity derived from an off-the-shelf Re-ID system. Our approach, based on nested importance sampling, selects pairs of images for human vetting driven by the pairwise similarity, and produces asymptotically unbiased population size estimates with associated confidence intervals. We perform experiments on various animal Re-ID datasets and demonstrate that our method outperforms strong baselines and active clustering approaches. In many cases, we are able to reduce the error rates of the estimated size from around 80% using CV alone to less than 20% by vetting a fraction (often less than 0.002%) of the total pairs. The cost of vetting reduces with the increase in accuracy and provides a practical approach for population size estimation within a desired tolerance when deploying Re-ID systems.
