Table of Contents
Fetching ...

TacticExpert: Spatial-Temporal Graph Language Model for Basketball Tactics

Xu Lingrui, Liu Mandi, Zhang Lei

TL;DR

TacticExpert addresses the challenge of modeling fine-grained spatial-temporal basketball tactics by combining a spatial-temporal propagation symmetry-aware graph transformer with a graph-grounded large language model. The approach introduces a Mixture of Tactic Experts routing mechanism, delay-aware ST attention, and four D2 symmetry views to capture long-range interactions, while aligning graph embeddings with LLM space through CLIP-based semantic alignment and in-context prompting. A two-stage pre-training strategy and dense training with sparse inference yield improved efficiency and robustness, including zero-shot and few-shot generalization to unseen teams or players. Experiments on a basketball-focused dataset demonstrate competitive downstream performance, improved interpretability, and strong inductive biases for tactical reasoning. This work provides a vertically integrated framework that unifies pretraining across datasets and tasks, enabling robust open-ended reasoning in basketball tactics while offering interpretable insights through expert specialization and visualizations.

Abstract

The core challenge in basketball tactic modeling lies in efficiently extracting complex spatial-temporal dependencies from historical data and accurately predicting various in-game events. Existing state-of-the-art (SOTA) models, primarily based on graph neural networks (GNNs), encounter difficulties in capturing long-term, long-distance, and fine-grained interactions among heterogeneous player nodes, as well as in recognizing interaction patterns. Additionally, they exhibit limited generalization to untrained downstream tasks and zero-shot scenarios. In this work, we propose a Spatial-Temporal Propagation Symmetry-Aware Graph Transformer for fine-grained game modeling. This architecture explicitly captures delay effects in the spatial space to enhance player node representations across discrete-time slices, employing symmetry-invariant priors to guide the attention mechanism. We also introduce an efficient contrastive learning strategy to train a Mixture of Tactics Experts module, facilitating differentiated modeling of offensive tactics. By integrating dense training with sparse inference, we achieve a 2.4x improvement in model efficiency. Moreover, the incorporation of Lightweight Graph Grounding for Large Language Models enables robust performance in open-ended downstream tasks and zero-shot scenarios, including novel teams or players. The proposed model, TacticExpert, delineates a vertically integrated large model framework for basketball, unifying pretraining across multiple datasets and downstream prediction tasks. Fine-grained modeling modules significantly enhance spatial-temporal representations, and visualization analyzes confirm the strong interpretability of the model.

TacticExpert: Spatial-Temporal Graph Language Model for Basketball Tactics

TL;DR

TacticExpert addresses the challenge of modeling fine-grained spatial-temporal basketball tactics by combining a spatial-temporal propagation symmetry-aware graph transformer with a graph-grounded large language model. The approach introduces a Mixture of Tactic Experts routing mechanism, delay-aware ST attention, and four D2 symmetry views to capture long-range interactions, while aligning graph embeddings with LLM space through CLIP-based semantic alignment and in-context prompting. A two-stage pre-training strategy and dense training with sparse inference yield improved efficiency and robustness, including zero-shot and few-shot generalization to unseen teams or players. Experiments on a basketball-focused dataset demonstrate competitive downstream performance, improved interpretability, and strong inductive biases for tactical reasoning. This work provides a vertically integrated framework that unifies pretraining across datasets and tasks, enabling robust open-ended reasoning in basketball tactics while offering interpretable insights through expert specialization and visualizations.

Abstract

The core challenge in basketball tactic modeling lies in efficiently extracting complex spatial-temporal dependencies from historical data and accurately predicting various in-game events. Existing state-of-the-art (SOTA) models, primarily based on graph neural networks (GNNs), encounter difficulties in capturing long-term, long-distance, and fine-grained interactions among heterogeneous player nodes, as well as in recognizing interaction patterns. Additionally, they exhibit limited generalization to untrained downstream tasks and zero-shot scenarios. In this work, we propose a Spatial-Temporal Propagation Symmetry-Aware Graph Transformer for fine-grained game modeling. This architecture explicitly captures delay effects in the spatial space to enhance player node representations across discrete-time slices, employing symmetry-invariant priors to guide the attention mechanism. We also introduce an efficient contrastive learning strategy to train a Mixture of Tactics Experts module, facilitating differentiated modeling of offensive tactics. By integrating dense training with sparse inference, we achieve a 2.4x improvement in model efficiency. Moreover, the incorporation of Lightweight Graph Grounding for Large Language Models enables robust performance in open-ended downstream tasks and zero-shot scenarios, including novel teams or players. The proposed model, TacticExpert, delineates a vertically integrated large model framework for basketball, unifying pretraining across multiple datasets and downstream prediction tasks. Fine-grained modeling modules significantly enhance spatial-temporal representations, and visualization analyzes confirm the strong interpretability of the model.

Paper Structure

This paper contains 25 sections, 22 equations, 6 figures, 4 tables.

Figures (6)

  • Figure 1: The findings in basketball scenarios. (a) Player nodes distribution (blue and orange represent the two teams); (b) Propagation delay effect of neighbor nodes (using A and H as examples); (c) Spatial dynamic dependency of player nodes (using A and F as examples); (d) Long-range spatial dependency of player nodes (using A and D as examples).
  • Figure 2: The overall architecture of TacticExpert.
  • Figure 3: Performance, Efficiency, and Hyperparameter Selection of TacticExpert. (a) Node classification task; (b) Link prediction task; (c) Graph classification task
  • Figure 4: Ablation study on full-shot performance
  • Figure 5: t-SNE visualization of tactic experts
  • ...and 1 more figures