Table of Contents
Fetching ...

Probabilistic Explanations for Linear Models

Bernardo Subercaseaux, Marcelo Arenas, Kuldeep S Meel

TL;DR

The notion of $(delta, \epsilon)$-SR, a simple relaxation of $\delta$-SRs, is proposed, and it is shown that this kind of explanation can be computed efficiently over linear models.

Abstract

Formal XAI is an emerging field that focuses on providing explanations with mathematical guarantees for the decisions made by machine learning models. A significant amount of work in this area is centered on the computation of "sufficient reasons". Given a model $M$ and an input instance $\vec{x}$, a sufficient reason for the decision $M(\vec{x})$ is a subset $S$ of the features of $\vec{x}$ such that for any instance $\vec{z}$ that has the same values as $\vec{x}$ for every feature in $S$, it holds that $M(\vec{x}) = M(\vec{z})$. Intuitively, this means that the features in $S$ are sufficient to fully justify the classification of $\vec{x}$ by $M$. For sufficient reasons to be useful in practice, they should be as small as possible, and a natural way to reduce the size of sufficient reasons is to consider a probabilistic relaxation; the probability of $M(\vec{x}) = M(\vec{z})$ must be at least some value $δ\in (0,1]$, for a random instance $\vec{z}$ that coincides with $\vec{x}$ on the features in $S$. Computing small $δ$-sufficient reasons ($δ$-SRs) is known to be a theoretically hard problem; even over decision trees--traditionally deemed simple and interpretable models--strong inapproximability results make the efficient computation of small $δ$-SRs unlikely. We propose the notion of $(δ, ε)$-SR, a simple relaxation of $δ$-SRs, and show that this kind of explanation can be computed efficiently over linear models.

Probabilistic Explanations for Linear Models

TL;DR

The notion of -SR, a simple relaxation of -SRs, is proposed, and it is shown that this kind of explanation can be computed efficiently over linear models.

Abstract

Formal XAI is an emerging field that focuses on providing explanations with mathematical guarantees for the decisions made by machine learning models. A significant amount of work in this area is centered on the computation of "sufficient reasons". Given a model and an input instance , a sufficient reason for the decision is a subset of the features of such that for any instance that has the same values as for every feature in , it holds that . Intuitively, this means that the features in are sufficient to fully justify the classification of by . For sufficient reasons to be useful in practice, they should be as small as possible, and a natural way to reduce the size of sufficient reasons is to consider a probabilistic relaxation; the probability of must be at least some value , for a random instance that coincides with on the features in . Computing small -sufficient reasons (-SRs) is known to be a theoretically hard problem; even over decision trees--traditionally deemed simple and interpretable models--strong inapproximability results make the efficient computation of small -SRs unlikely. We propose the notion of -SR, a simple relaxation of -SRs, and show that this kind of explanation can be computed efficiently over linear models.
Paper Structure (14 sections, 16 theorems, 97 equations, 2 tables, 2 algorithms)

This paper contains 14 sections, 16 theorems, 97 equations, 2 tables, 2 algorithms.

Key Result

Proposition 1

Given a linear model $\mathcal{L}$, an instance ${\bm{x}}$, and $\delta \in [0,1]$, the size of the smallest $\delta$-SR for $(\mathcal{L}, {\bm{x}})$ cannot be computed in polynomial time unless $\mathrm{FP} = \mathrm{\#P}$.

Theorems & Definitions (34)

  • Example 1
  • Definition 1: Sufficient Reason Darwiche_Hirth_2020
  • Definition 2: Waldchen_MacDonald_Hauch_Kutyniok_2021
  • Definition 3
  • Example 2
  • Proposition 1
  • Definition 4: $(\delta, \varepsilon)$-min-SR
  • Theorem 1: Kozachinskiy_2023, Theorem 1
  • Theorem 2
  • Definition 5
  • ...and 24 more