Integrating Unsupervised and Supervised Learning for the Prediction of Defensive Schemes in American football

Rouven Michels; Robert Bajons; Jan-Ole Fischer

Integrating Unsupervised and Supervised Learning for the Prediction of Defensive Schemes in American football

Rouven Michels, Robert Bajons, Jan-Ole Fischer

TL;DR

This work tackles the challenge of predicting NFL defensive schemes (man vs zone) from pre-snap motion and tracking data by coupling an unsupervised non-homogeneous hidden Markov model with supervised learners. The HMM infers latent defender–offense guard assignments across motion, producing features such as switch counts and entropy that feed elastic net and XGBoost classifiers, yielding improved prediction accuracy and significant associations with coverage outcomes via the Generalized Covariance Measure. Key contributions include the integration of random effects in the HMM, a data-driven lag selection for defender reaction time, and a rigorous out-of-sample evaluation demonstrating the practical value of latent-state features for understanding defensive behavior and motion-based offense advantages. The framework is modular, enabling extensions to finer-grained coverages and potential neural-network integrations, with implications for broader analytics in team sports.

Abstract

Anticipating defensive coverage schemes is a crucial yet challenging task for offenses in American football. Because defenders' assignments are intentionally disguised before the snap, they remain difficult to recognize in real time. To address this challenge, we develop a statistical framework that integrates supervised and unsupervised learning using player tracking data. Our goal is to forecast the defensive coverage scheme -- man or zone -- through elastic net logistic regression and gradient-boosted decision trees with incrementally derived features. We first use features from the pre-motion situation, then incorporate players' trajectories during motion in a naive way, and finally include features derived from a hidden Markov model (HMM). Based on player movements, the non-homogeneous HMM infers latent defensive assignments between offensive and defensive players during motion and transforms decoded state sequences into informative features for the supervised models. These HMM-based features enhance predictive performance and are significantly associated with coverage outcomes. Moreover, estimated random effects offer interpretable insights into how different defenses and positions adjust their coverage responsibilities.

Integrating Unsupervised and Supervised Learning for the Prediction of Defensive Schemes in American football

TL;DR

Abstract

Paper Structure (20 sections, 17 equations, 11 figures, 3 tables)

This paper contains 20 sections, 17 equations, 11 figures, 3 tables.

Introduction
Data
Methods
Unsupervised learning
HMM formulation
Modeling the underlying Markov chain
Model fitting
Feature extraction from state decoding
Supervised learning
Model evaluation and inference
Results
Unsupervised learning
Supervised learning
Team analysis
Discussion
...and 5 more sections

Figures (11)

Figure 1: This figure illustrates the step-by-step pre-processing for a game between the Arizona Cardinals and the Kansas City Chiefs. From top left to bottom right, we first display all players, then exclude offensive non-skill players, next omit defensive linemen, and finally remove pass-rushing linebackers and deep safeties. Grey circles mark the players excluded during each pre-processing step.
Figure 1: This figure displays the result of a preliminary analysis of a homogeneous HMM. On the x-axis, we displayed the lag $l$ (see Eq. \ref{['eq:lag']}) and on the y-axis the corresponding AIC value.
Figure 2: This figure illustrates the HMM modeling approach. Specifically, we project the $(x,y)$-coordinates of the players onto a single $\mathrm{y}$-axis. We assume that the observations $y_t$, i.e., the defensive players' $\mathrm{y}$-coordinate, is generated by one of the five Gaussian state-dependent distributions, whose mean equals the $\mathrm{y}$-coordinate of the five offensive players.
Figure 2: Predicted random effect for each defense. Smaller values correspond to a higher probability of staying in the respective state, i.e., continuing to covering a specific defender, and vice versa.
Figure 3: Dependence structure of the HMM used to model the defenders’ vertical positions $Y_t$, driven by the latent state variable $S_t$ that represents the currently guarded offensive player.
...and 6 more figures

Integrating Unsupervised and Supervised Learning for the Prediction of Defensive Schemes in American football

TL;DR

Abstract

Integrating Unsupervised and Supervised Learning for the Prediction of Defensive Schemes in American football

Authors

TL;DR

Abstract

Table of Contents

Figures (11)