Table of Contents
Fetching ...

$φ$-test: Global Feature Selection and Inference for Shapley Additive Explanations

Dongseok Kim, Hyoungsun Choi, Mohamed Jismy Aashik Rasool, Gisung Oh

TL;DR

phi-test addresses the need for statistically principled global feature importance for black-box predictors by marrying SHAP-based screening with a linear surrogate and selective-inference-based post-selection uncertainty. The method yields a global feature-importance table that includes SHAP scores, surrogate coefficients, and selection-adjusted p-values and confidence intervals. Across real tabular datasets with tree-based and neural backbones, phi-test maintains predictive fidelity with a small, stable feature subset and demonstrates robustness across backbones. This work provides a practical bridge between SHAP explanations and classical statistical inference, enabling interpretable, uncertainty-aware global explanations.

Abstract

We propose $φ$-test, a global feature-selection and significance procedure for black-box predictors that combines Shapley attributions with selective inference. Given a trained model and an evaluation dataset, $φ$-test performs SHAP-guided screening and fits a linear surrogate on the screened features via a selection rule with a tractable selective-inference form. For each retained feature, it outputs a Shapley-based global score, a surrogate coefficient, and post-selection $p$-values and confidence intervals in a global feature-importance table. Experiments on real tabular regression tasks with tree-based and neural backbones suggest that $φ$-test can retain much of the predictive ability of the original model while using only a few features and producing feature sets that remain fairly stable across resamples and backbone classes. In these settings, $φ$-test acts as a practical global explanation layer linking Shapley-based importance summaries with classical statistical inference.

$φ$-test: Global Feature Selection and Inference for Shapley Additive Explanations

TL;DR

phi-test addresses the need for statistically principled global feature importance for black-box predictors by marrying SHAP-based screening with a linear surrogate and selective-inference-based post-selection uncertainty. The method yields a global feature-importance table that includes SHAP scores, surrogate coefficients, and selection-adjusted p-values and confidence intervals. Across real tabular datasets with tree-based and neural backbones, phi-test maintains predictive fidelity with a small, stable feature subset and demonstrates robustness across backbones. This work provides a practical bridge between SHAP explanations and classical statistical inference, enabling interpretable, uncertainty-aware global explanations.

Abstract

We propose -test, a global feature-selection and significance procedure for black-box predictors that combines Shapley attributions with selective inference. Given a trained model and an evaluation dataset, -test performs SHAP-guided screening and fits a linear surrogate on the screened features via a selection rule with a tractable selective-inference form. For each retained feature, it outputs a Shapley-based global score, a surrogate coefficient, and post-selection -values and confidence intervals in a global feature-importance table. Experiments on real tabular regression tasks with tree-based and neural backbones suggest that -test can retain much of the predictive ability of the original model while using only a few features and producing feature sets that remain fairly stable across resamples and backbone classes. In these settings, -test acts as a practical global explanation layer linking Shapley-based importance summaries with classical statistical inference.

Paper Structure

This paper contains 48 sections, 3 theorems, 66 equations, 7 tables, 1 algorithm.

Key Result

Lemma 4.1

Let $S_0 = \mathcal{T}_M\bigl((I_j)_{j=1}^p\bigr)$ denote the candidate feature set produced by Stage 1 (Subsection subsec:stage-1) of the $\phi$-test based on global SHAP scores. Under eq:working-model, the set $S_0$ is non-random with respect to $y^{(\mathrm{sur})}$, so all probabilities can be in

Theorems & Definitions (7)

  • Lemma 4.1: Ancillarity of SHAP screening
  • Proposition 4.2: Selective validity
  • proof : Proof sketch
  • Remark 4.3: Scope and interpretation
  • Remark 4.4: Split-sample $\phi$-test
  • Lemma 2.1: Truncated-normal law for $T_j$
  • proof