Annotation Techniques for Judo Combat Phase Classification from Tournament Footage
Anthony Miyaguchi, Jed Moutahir, Tanmay Sutar
TL;DR
This work tackles automated annotation and combat-phase classification in fixed-angle judo tournament footage under limited labeled data. It introduces a semi-supervised pipeline that transfers knowledge from a fine-tuned object detector to classify match presence, activity, and standing states, using a combination of frame-level labeling, OCR timer cues, and embedding-based classifiers with and without temporal context. The approach is validated on a dataset of 19 thirty-second clips, achieving competitive F1 scores and demonstrating the feasibility of automated match segmentation and phase analysis in semi-supervised settings. The study lays groundwork for scalable, automated retrieval of highlights, statistics, and strategic insights from judo broadcasts, with multiple avenues for future multimodal and technique-level extensions.
Abstract
This paper presents a semi-supervised approach to extracting and analyzing combat phases in judo tournaments using live-streamed footage. The objective is to automate the annotation and summarization of live streamed judo matches. We train models that extract relevant entities and classify combat phases from fixed-perspective judo recordings. We employ semi-supervised methods to address limited labeled data in the domain. We build a model of combat phases via transfer learning from a fine-tuned object detector to classify the presence, activity, and standing state of the match. We evaluate our approach on a dataset of 19 thirty-second judo clips, achieving an F1 score on a $20\%$ test hold-out of 0.66, 0.78, and 0.87 for the three classes, respectively. Our results show initial promise for automating more complex information retrieval tasks using rigorous methods with limited labeled data.
