Table of Contents
Fetching ...

On the Tractability of SHAP Explanations under Markovian Distributions

Reda Marzouk, Colin de La Higuera

TL;DR

This work addresses the computational intractability of SHAP explanations by relaxing feature independence to Markovian distributions and proving tractability for sequential models. It develops a constructive, language-based framework using Weighted Automata and Weighted Transducers to compute SHAP scores in polynomial time, first for WA under MARKOV and then via reductions for disjoint DNFs and decision trees. The key insight is to decompose SHAP into SHAP1 and SHAP2, express them through language operators, and build polynomial-time WA/WT constructions (aided by Bayes’ rule) that realize the required probabilities. Collectively, the results extend SHAP tractability beyond independence, offering scalable algorithms and laying groundwork for generalized Markovian explanations in XAI with potential extensions to higher-order models.

Abstract

Thanks to its solid theoretical foundation, the SHAP framework is arguably one the most widely utilized frameworks for local explainability of ML models. Despite its popularity, its exact computation is known to be very challenging, proven to be NP-Hard in various configurations. Recent works have unveiled positive complexity results regarding the computation of the SHAP score for specific model families, encompassing decision trees, random forests, and some classes of boolean circuits. Yet, all these positive results hinge on the assumption of feature independence, often simplistic in real-world scenarios. In this article, we investigate the computational complexity of the SHAP score by relaxing this assumption and introducing a Markovian perspective. We show that, under the Markovian assumption, computing the SHAP score for the class of Weighted automata, Disjoint DNFs and Decision Trees can be performed in polynomial time, offering a first positive complexity result for the problem of SHAP score computation that transcends the limitations of the feature independence assumption.

On the Tractability of SHAP Explanations under Markovian Distributions

TL;DR

This work addresses the computational intractability of SHAP explanations by relaxing feature independence to Markovian distributions and proving tractability for sequential models. It develops a constructive, language-based framework using Weighted Automata and Weighted Transducers to compute SHAP scores in polynomial time, first for WA under MARKOV and then via reductions for disjoint DNFs and decision trees. The key insight is to decompose SHAP into SHAP1 and SHAP2, express them through language operators, and build polynomial-time WA/WT constructions (aided by Bayes’ rule) that realize the required probabilities. Collectively, the results extend SHAP tractability beyond independence, offering scalable algorithms and laying groundwork for generalized Markovian explanations in XAI with potential extensions to higher-order models.

Abstract

Thanks to its solid theoretical foundation, the SHAP framework is arguably one the most widely utilized frameworks for local explainability of ML models. Despite its popularity, its exact computation is known to be very challenging, proven to be NP-Hard in various configurations. Recent works have unveiled positive complexity results regarding the computation of the SHAP score for specific model families, encompassing decision trees, random forests, and some classes of boolean circuits. Yet, all these positive results hinge on the assumption of feature independence, often simplistic in real-world scenarios. In this article, we investigate the computational complexity of the SHAP score by relaxing this assumption and introducing a Markovian perspective. We show that, under the Markovian assumption, computing the SHAP score for the class of Weighted automata, Disjoint DNFs and Decision Trees can be performed in polynomial time, offering a first positive complexity result for the problem of SHAP score computation that transcends the limitations of the feature independence assumption.
Paper Structure (23 sections, 15 theorems, 56 equations, 1 figure)