Probabilistic Explanations for Linear Models

Bernardo Subercaseaux; Marcelo Arenas; Kuldeep S Meel

Probabilistic Explanations for Linear Models

Bernardo Subercaseaux, Marcelo Arenas, Kuldeep S Meel

TL;DR

The notion of $(delta, \epsilon)$-SR, a simple relaxation of $\delta$-SRs, is proposed, and it is shown that this kind of explanation can be computed efficiently over linear models.

Abstract

Formal XAI is an emerging field that focuses on providing explanations with mathematical guarantees for the decisions made by machine learning models. A significant amount of work in this area is centered on the computation of "sufficient reasons". Given a model $M$ and an input instance $\vec{x}$, a sufficient reason for the decision $M(\vec{x})$ is a subset $S$ of the features of $\vec{x}$ such that for any instance $\vec{z}$ that has the same values as $\vec{x}$ for every feature in $S$, it holds that $M(\vec{x}) = M(\vec{z})$. Intuitively, this means that the features in $S$ are sufficient to fully justify the classification of $\vec{x}$ by $M$. For sufficient reasons to be useful in practice, they should be as small as possible, and a natural way to reduce the size of sufficient reasons is to consider a probabilistic relaxation; the probability of $M(\vec{x}) = M(\vec{z})$ must be at least some value $δ\in (0,1]$, for a random instance $\vec{z}$ that coincides with $\vec{x}$ on the features in $S$. Computing small $δ$-sufficient reasons ($δ$-SRs) is known to be a theoretically hard problem; even over decision trees--traditionally deemed simple and interpretable models--strong inapproximability results make the efficient computation of small $δ$-SRs unlikely. We propose the notion of $(δ, ε)$-SR, a simple relaxation of $δ$-SRs, and show that this kind of explanation can be computed efficiently over linear models.

Probabilistic Explanations for Linear Models

TL;DR

The notion of

-SR, a simple relaxation of

-SRs, is proposed, and it is shown that this kind of explanation can be computed efficiently over linear models.

Abstract

and an input instance

, a sufficient reason for the decision

is a subset

of the features of

such that for any instance

that has the same values as

for every feature in

, it holds that

. Intuitively, this means that the features in

are sufficient to fully justify the classification of

. For sufficient reasons to be useful in practice, they should be as small as possible, and a natural way to reduce the size of sufficient reasons is to consider a probabilistic relaxation; the probability of

must be at least some value

, for a random instance

that coincides with

on the features in

. Computing small

-sufficient reasons (

-SRs) is known to be a theoretically hard problem; even over decision trees--traditionally deemed simple and interpretable models--strong inapproximability results make the efficient computation of small

-SRs unlikely. We propose the notion of

-SR, a simple relaxation of

-SRs, and show that this kind of explanation can be computed efficiently over linear models.

Paper Structure (14 sections, 16 theorems, 97 equations, 2 tables, 2 algorithms)

This paper contains 14 sections, 16 theorems, 97 equations, 2 tables, 2 algorithms.

Introduction
Probabilistic Sufficient Reasons
The size of delta-SRs
Approximating delta-Sufficient Reasons
A Simple Relaxation: (delta, epsilon)-min-SR
Estimating the Probability of Acceptance
Feature Selection
Locally Minimal Probabilistic Explanations
Approximations on the Size of Explanations
Conclusions and Future Work
Calculation for Example 1
Proposition 1
Proposition 2
Lemma 1 and Theorem 4

Key Result

Proposition 1

Given a linear model $\mathcal{L}$, an instance ${\bm{x}}$, and $\delta \in [0,1]$, the size of the smallest $\delta$-SR for $(\mathcal{L}, {\bm{x}})$ cannot be computed in polynomial time unless $\mathrm{FP} = \mathrm{\#P}$.

Theorems & Definitions (34)

Example 1
Definition 1: Sufficient Reason Darwiche_Hirth_2020
Definition 2: Waldchen_MacDonald_Hauch_Kutyniok_2021
Definition 3
Example 2
Proposition 1
Definition 4: $(\delta, \varepsilon)$-min-SR
Theorem 1: Kozachinskiy_2023, Theorem 1
Theorem 2
Definition 5
...and 24 more

Probabilistic Explanations for Linear Models

TL;DR

Abstract

Probabilistic Explanations for Linear Models

Authors

TL;DR

Abstract

Table of Contents

Key Result

Theorems & Definitions (34)