Tracking Skiers from the Top to the Bottom
Matteo Dunnhofer, Luca Sordi, Niki Martinel, Christian Micheloni
TL;DR
This paper tackles the challenge of tracking skiers across an entire performance in monocular multi-camera broadcasts, addressing a gap in skiing-focused benchmarks. It introduces SkiTB, a large, densely annotated dataset designed for comprehensive per-frame skier localization and multi-camera analysis, along with a skier-optimized baseline tracker (STARK_SKI) and a fine-tuned STARK (STARK_FT). Through extensive experiments comparing generic trackers and skier-specific approaches, the study shows that domain-specific trackers yield substantial gains in localization accuracy and improve downstream 2D pose estimation (SkiPosePTZ), under diverse conditions and splits (new performances, unseen athletes, unseen courses). The results highlight the practical potential of vision-based skiing analytics for performance understanding and broadcasting, while also identifying remaining challenges such as cross-camera continuity, occlusions, small appearance, and efficiency. Overall, SkiTB enables robust evaluation of tracking methods in skiing and guides future work toward better generalization and real-time applicability in high-level analysis pipelines.
Abstract
Skiing is a popular winter sport discipline with a long history of competitive events. In this domain, computer vision has the potential to enhance the understanding of athletes' performance, but its application lags behind other sports due to limited studies and datasets. This paper makes a step forward in filling such gaps. A thorough investigation is performed on the task of skier tracking in a video capturing his/her complete performance. Obtaining continuous and accurate skier localization is preemptive for further higher-level performance analyses. To enable the study, the largest and most annotated dataset for computer vision in skiing, SkiTB, is introduced. Several visual object tracking algorithms, including both established methodologies and a newly introduced skier-optimized baseline algorithm, are tested using the dataset. The results provide valuable insights into the applicability of different tracking methods for vision-based skiing analysis. SkiTB, code, and results are available at https://machinelearning.uniud.it/datasets/skitb.
