PitcherNet: Powering the Moneyball Evolution in Baseball Video Analytics
Jerrin Bright, Bavesh Balaji, Yuhao Chen, David A Clausi, John S Zelek
TL;DR
PitcherNet tackles the challenge of extracting pitcher kinematics and pitch statistics from live broadcast video by integrating robust 3D human modeling with a kinematic-driven analysis pipeline. It decouples action from tracklets for reliable pitcher identification, uses D2A-HMR 2.0 with a Depth Anything encoder to estimate 3D pose, and derives statistics such as pitch position, release point, velocity, and release extension from the kinematic data. The approach achieves state-of-the-art performance on MLBPitchDB, including high pitcher-tracklet identification accuracy and strong 3D pose metrics, aided by depth-based improvements and pseudo-ground-truth data. This work enables real-time, data-driven baseball analytics for coaching, strategy, injury prevention, and deeper biomechanical understanding of pitching mechanics in live-game settings.
Abstract
In the high-stakes world of baseball, every nuance of a pitcher's mechanics holds the key to maximizing performance and minimizing runs. Traditional analysis methods often rely on pre-recorded offline numerical data, hindering their application in the dynamic environment of live games. Broadcast video analysis, while seemingly ideal, faces significant challenges due to factors like motion blur and low resolution. To address these challenges, we introduce PitcherNet, an end-to-end automated system that analyzes pitcher kinematics directly from live broadcast video, thereby extracting valuable pitch statistics including velocity, release point, pitch position, and release extension. This system leverages three key components: (1) Player tracking and identification by decoupling actions from player kinematics; (2) Distribution and depth-aware 3D human modeling; and (3) Kinematic-driven pitch statistics. Experimental validation demonstrates that PitcherNet achieves robust analysis results with 96.82% accuracy in pitcher tracklet identification, reduced joint position error by 1.8mm and superior analytics compared to baseline methods. By enabling performance-critical kinematic analysis from broadcast video, PitcherNet paves the way for the future of baseball analytics by optimizing pitching strategies, preventing injuries, and unlocking a deeper understanding of pitcher mechanics, forever transforming the game.
