Prediction-based evaluation of back-four defense with spatial control in soccer
Soujanya Dash, Kenjiro Ide, Rikuhei Umemoto, Kai Amino, Keisuke Fujii
TL;DR
Addressing the challenge of quantifying collective back-four defense during negative transitions in elite soccer, the study introduces interpretable spatio-temporal indicators (Space Score, Stretch Index, Pressure Index, and Defensive Line Height Absolute/Relative) derived from synchronized tracking and event data. Analyzing 2,413 defensive sequences from 73 LaLiga matches (Barcelona and Real Madrid) with two-way ANOVA and team-specific predictive models (XGBoost, Random Forest, SVC) reveals that Defensive Line Height relative to the ball is the strongest predictor of defensive success, with Space Score also playing a crucial role. Barcelona displays stronger, more consistent spatial control and line coordination, while Real Madrid shows more adaptive but less stable defensive structures. The work demonstrates that combining interpretable spatial metrics with inferential and predictive analyses provides actionable insights for coaching and real-time tactical analytics in elite soccer.
Abstract
Defensive organization is critical in soccer, particularly during negative transitions when teams are most vulnerable. The back-four defensive line plays a decisive role in preventing goal-scoring opportunities, yet its collective coordination remains difficult to quantify. This study introduces interpretable spatio-temporal indicators namely, space control, stretch index, pressure index, and defensive line height (absolute and relative) to evaluate the effectiveness of the back-four during defensive transitions. Using synchronized tracking and event data from the 2023-24 LaLiga season, 2,413 defensive sequences were analyzed following possession losses by FC Barcelona and Real Madrid CF. Two-way ANOVA revealed significant effects of team, outcome, and their interaction for key indicators, with relative line height showing the strongest association with defensive success. Predictive modeling using XGBoost achieved the highest discriminative performance (ROC AUC: 0.724 for Barcelona, 0.698 for Real Madrid), identifying space score and relative line height as dominant predictors. Comparative analysis revealed distinct team-specific defensive behaviors: Barcelona's success was characterized by higher spatial control and compact line coordination, whereas Real Madrid exhibited more adaptive but less consistent defensive structures. These findings demonstrate the tactical and predictive value of interpretable spatial indicators for quantifying collective defensive performance.
