Decomposing Global Feature Effects Based on Feature Interactions
Julia Herbinger, Marvin N. Wright, Thomas Nagler, Bernd Bischl, Giuseppe Casalicchio
TL;DR
The paper addresses aggregation bias in global feature effect plots when interactions are present by introducing GADGET, a framework that partitions the feature space into regions with minimized interaction-related heterogeneity. It establishes a local-decomposability axiom and shows that PD, ALE, and SD satisfy it, enabling regional explanations (GADGET-PD, GADGET-ALE, GADGET-SD) with additive regional main effects. A new permutation-based PINT procedure detects global feature interactions in a model-agnostic way, guiding the selection of interacting feature subsets. Empirical studies across simulations and real-world datasets (COMPAS, bikesharing) demonstrate GADGET’s ability to reduce interaction-related heterogeneity (R^2 Tot close to 0.9–0.99 in examples) and produce more faithful regional explanations, with high-dimensional extensions via filtering. The work highlights practical implications for model auditing, bias detection, and interpretable ML, while noting limitations such as higher-order interaction detection in SD without recalculation and the Rashomon effect across models.
Abstract
Global feature effect methods, such as partial dependence plots, provide an intelligible visualization of the expected marginal feature effect. However, such global feature effect methods can be misleading, as they do not represent local feature effects of single observations well when feature interactions are present. We formally introduce generalized additive decomposition of global effects (GADGET), which is a new framework based on recursive partitioning to find interpretable regions in the feature space such that the interaction-related heterogeneity of local feature effects is minimized. We provide a mathematical foundation of the framework and show that it is applicable to the most popular methods to visualize marginal feature effects, namely partial dependence, accumulated local effects, and Shapley additive explanations (SHAP) dependence. Furthermore, we introduce and validate a new permutation-based interaction detection procedure that is applicable to any feature effect method that fits into our proposed framework. We empirically evaluate the theoretical characteristics of the proposed methods based on various feature effect methods in different experimental settings. Moreover, we apply our introduced methodology to three real-world examples to showcase their usefulness.
