Aumann-SHAP: The Geometry of Counterfactual Interaction Explanations in Machine Learning

Adam Belahcen; Stéphane Mussard

Aumann-SHAP: The Geometry of Counterfactual Interaction Explanations in Machine Learning

Adam Belahcen, Stéphane Mussard

Abstract

We introduce Aumann-SHAP, an interaction-aware framework that decomposes counterfactual transitions by restricting the model to a local hypercube connecting baseline and counterfactual features. Each hyper-cube is decomposed into a grid in order to construct an induced micro-player cooperative game in which elementary grid-step moves become players. Shapley and LES values on this TU-micro-game yield: (i) within-pot contribution of each feature to the interaction with other features (interaction explainability), and (ii) the contribution of each instance and each feature to the counterfactual analysis (individual and global explainability). In particular, Aumann-LES values produce individual and global explanations along the counterfactual transition. Shapley and LES values converge to the diagonal Aumann-Shapley (integrated-gradients) attribution method. Experiments on the German Credit dataset and MNIST data show that Aumann-LES produces robust results and better explanations than the standard Shapley value during the counterfactual transition.

Aumann-SHAP: The Geometry of Counterfactual Interaction Explanations in Machine Learning

Abstract

Paper Structure (23 sections, 5 theorems, 77 equations, 8 figures, 8 tables)

This paper contains 23 sections, 5 theorems, 77 equations, 8 figures, 8 tables.

Introduction
State of the art
Explainability and feature attribution
Explainability and counterfactual analysis
TU-micro-game: decomposition of feature interactions
Setup
Counterfactual analysis on local cube restriction
Discretization of the cube on a grid & TU-micro-game
Aumann-SHAP
Interaction, local and global explainability
Generalization to LES values
Experiments
Simulations: Exact computation and complexity
Convergence.
German credit
...and 8 more sections

Key Result

proposition 1

Let $u\subseteq N$ with $|u|=k\ge 2$. $(\imath)$ If $t_i=0$ for some $i\in u$, then $r_u(t)=0$. (Boundary)$(\imath\imath)$ At the far corner, $r_u(\mathbf{1}_k)=\phi_u$. (Efficiency)

Figures (8)

Figure 1: Local slider cube for $u=\{i,j,k\}$ (left) and the sub-cube from $(0,0,0)$ to $(t_i,t_j,t_k)$ (right), illustrating corner evaluations and the inclusion--exclusion construction.
Figure 2: Solid curves with markers provide runtimes: grid-state computation (blue) and, when feasible, full micro-coalition enumeration (orange). Dashed curves are fitted scaling laws: $\sim c\,k(m{+}1)^k$ for grid-state (green) and $\sim c\,n2^n$ with $n{=}km$ for full enumeration (red).
Figure 3: MNIST $1\to 7$ counterfactual: attribution maps and redistribution. Top row: baseline $x^0$, endpoint $x^1$ (only the $k$ changed pixels are moved), and Top-25 masks for equal-split and TU-micro-game Shapley (red). Bottom row: signed heatmaps, redistribution $D=\text{micro}-\text{equal}$, and an overlay of the largest re-allocations on $x^1$.
Figure 4: Global MNIST $1\to7$ mean explanations (nearest-neighbor pairing). We sample $200$ test-set baselines $x^0$ of digit $1$ and pair each with its nearest neighbor $x^1$ among test-set $7$'s (Euclidean distance), then compute local attributions on the changed-pixel set $\{(r,c):|x^1_{rc}-x^0_{rc}|>\varepsilon\}$ with $\varepsilon=0.05$ and average the resulting maps over pairs.
Figure 5: Counterfactual patch test on MNIST ($1\to7$): $P_7$ after patching the top-$K$ pixels selected by each rule (Micro-game Shapley, Equal-split Shapley, Equal surplus, and $|x^1-x^0|$), compared to a random baseline (mean with 10--90% band). Top row shows $x^0$ and the patched images at $K=12$.
...and 3 more figures

Theorems & Definitions (8)

definition 1: Harsanyi dividend on a local-cube
proposition 1: Properties
definition 2: Grid discretization
definition 3: Micro-players and TU-micro-game
theorem 1: Micro-player Shapley value
proposition 2: Equal-split Shapley luborgonovo2024
theorem 2: Aumann--SHAP
proposition 3: LES closed form

Aumann-SHAP: The Geometry of Counterfactual Interaction Explanations in Machine Learning

Abstract

Aumann-SHAP: The Geometry of Counterfactual Interaction Explanations in Machine Learning

Authors

Abstract

Table of Contents

Key Result

Figures (8)

Theorems & Definitions (8)