Benchmarking pig detection and tracking under diverse and challenging conditions
Jonathan Henrich, Christian Post, Maximilian Zilke, Parth Shiroya, Emma Chanut, Amir Mollazadeh Yamchi, Ramin Yahyapour, Thomas Kneib, Imke Traulsen
TL;DR
This work delivers two public benchmark datasets, PigDetect for pig detection and PigTrack for pig tracking, gathered from diverse barn environments and annotated with challenging cases to stress modern algorithms. Through systematic benchmarking of state-of-the-art general-purpose detectors and trackers, the study finds that data-centric improvements (e.g., including challenging images) yield notable gains, that SORT-based trackers often outperform end-to-end models on detection while end-to-end models excel at association, and that strong generalization relies on high-quality, diverse training data. Oracle analyses imply detection quality is a key bottleneck for tracking in realistic pens, while third-party transfers demonstrate the value of open data for reproducibility and progress. The datasets and code are released to facilitate ongoing development and fair comparisons in precision livestock farming. Overall, the paper advances reproducibility and provides practical insights into model selection based on application needs and data availability.
Abstract
To ensure animal welfare and effective management in pig farming, monitoring individual behavior is a crucial prerequisite. While monitoring tasks have traditionally been carried out manually, advances in machine learning have made it possible to collect individualized information in an increasingly automated way. Central to these methods is the localization of animals across space (object detection) and time (multi-object tracking). Despite extensive research of these two tasks in pig farming, a systematic benchmarking study has not yet been conducted. In this work, we address this gap by curating two datasets: PigDetect for object detection and PigTrack for multi-object tracking. The datasets are based on diverse image and video material from realistic barn conditions, and include challenging scenarios such as occlusions or bad visibility. For object detection, we show that challenging training images improve detection performance beyond what is achievable with randomly sampled images alone. Comparing different approaches, we found that state-of-the-art models offer substantial improvements in detection quality over real-time alternatives. For multi-object tracking, we observed that SORT-based methods achieve superior detection performance compared to end-to-end trainable models. However, end-to-end models show better association performance, suggesting they could become strong alternatives in the future. We also investigate characteristic failure cases of end-to-end models, providing guidance for future improvements. The detection and tracking models trained on our datasets perform well in unseen pens, suggesting good generalization capabilities. This highlights the importance of high-quality training data. The datasets and research code are made publicly available to facilitate reproducibility, re-use and further development.
