Beyond Additivity: Sparse Isotonic Shapley Regression toward Nonlinear Explainability

Jialai She

Beyond Additivity: Sparse Isotonic Shapley Regression toward Nonlinear Explainability

Jialai She

TL;DR

This work tackles the limitations of standard Shapley attributions when payoffs are non-additive due to non-Gaussianity, feature dependence, or domain-specific losses, and when high-dimensional sparsity is required. It introduces Sparse Isotonic Shapley Regression (SISR), which learns a data-driven monotone transformation to restore additivity in the transformed domain while enforcing an $\ell_0$ sparsity constraint; the optimization combines Pool-Adjacent-Violators isotonic regression with normalized hard-thresholding and comes with global convergence guarantees. The authors prove that the transformation can be accurately recovered in many settings and demonstrate through extensive experiments on regression, logistic regression, and tree ensembles that SISR stabilizes attributions across payoff schemes, correctly filters irrelevant features, and avoids rank and sign distortions seen with conventional Shapley methods. They also discuss extensions to nonlinear Shapley GLMs and emphasize that nonlinear payoff distortions can arise from feature Irrelevance and dependencies, making nonlinear explainability essential for robust interpretation in practice.

Abstract

Shapley values, a gold standard for feature attribution in Explainable AI, face two primary challenges. First, the canonical Shapley framework assumes that the worth function is additive, yet real-world payoff constructions--driven by non-Gaussian distributions, heavy tails, feature dependence, or domain-specific loss scales--often violate this assumption, leading to distorted attributions. Secondly, achieving sparse explanations in high dimensions by computing dense Shapley values and then applying ad hoc thresholding is prohibitively costly and risks inconsistency. We introduce Sparse Isotonic Shapley Regression (SISR), a unified nonlinear explanation framework. SISR simultaneously learns a monotonic transformation to restore additivity--obviating the need for a closed-form specification--and enforces an L0 sparsity constraint on the Shapley vector, enhancing computational efficiency in large feature spaces. Its optimization algorithm leverages Pool-Adjacent-Violators for efficient isotonic regression and normalized hard-thresholding for support selection, yielding implementation ease and global convergence guarantees. Analysis shows that SISR recovers the true transformation in a wide range of scenarios and achieves strong support recovery even in high noise. Moreover, we are the first to demonstrate that irrelevant features and inter-feature dependencies can induce a true payoff transformation that deviates substantially from linearity. Experiments in regression, logistic regression, and tree ensembles demonstrate that SISR stabilizes attributions across payoff schemes, correctly filters irrelevant features while standard Shapley values suffer severe rank and sign distortions. By unifying nonlinear transformation estimation with sparsity pursuit, SISR advances the frontier of nonlinear explainability, providing a theoretically grounded and practical attribution framework.

Beyond Additivity: Sparse Isotonic Shapley Regression toward Nonlinear Explainability

TL;DR

Abstract

Beyond Additivity: Sparse Isotonic Shapley Regression toward Nonlinear Explainability

TL;DR

Abstract

Paper Structure

Table of Contents

Key Result

Figures (9)

Theorems & Definitions (5)