Augmenting Sports Videos with VisCommentator
Chen Zhu-Tian, Shuainan Ye, Xiangtong Chu, Haijun Xia, Hui Zhang, Huamin Qu, Yingcai Wu
TL;DR
This paper tackles the challenge of embedding data visualizations in sports videos by deriving a design space that separates what data/visuals are used from how they are organized. It analyzes 233 real-world augmented videos to define element-level (Data Type, Visual Type) and clip-level (Data Level, Narrative Order) dimensions, and demonstrates patterns that guide visualization design. Building on this space, VisCommentator provides a data-driven, rapid prototyping tool for table tennis videos, automatically extracting data with ML models, enabling direct interaction with video objects, and recommending visuals via a probabilistic mapping that accounts for narrative order, implemented in a browser/server architecture. A reproduction-based user study with seven domain experts shows high usability and efficiency, indicating strong potential for generalization to other racket sports and for guiding future integration with visual analytics and broader sports contexts.
Abstract
Visualizing data in sports videos is gaining traction in sports analytics, given its ability to communicate insights and explicate player strategies engagingly. However, augmenting sports videos with such data visualizations is challenging, especially for sports analysts, as it requires considerable expertise in video editing. To ease the creation process, we present a design space that characterizes augmented sports videos at an element-level (what the constituents are) and clip-level (how those constituents are organized). We do so by systematically reviewing 233 examples of augmented sports videos collected from TV channels, teams, and leagues. The design space guides selection of data insights and visualizations for various purposes. Informed by the design space and close collaboration with domain experts, we design VisCommentator, a fast prototyping tool, to eases the creation of augmented table tennis videos by leveraging machine learning-based data extractors and design space-based visualization recommendations. With VisCommentator, sports analysts can create an augmented video by selecting the data to visualize instead of manually drawing the graphical marks. Our system can be generalized to other racket sports (e.g., tennis, badminton) once the underlying datasets and models are available. A user study with seven domain experts shows high satisfaction with our system, confirms that the participants can reproduce augmented sports videos in a short period, and provides insightful implications into future improvements and opportunities.
