Dual feature-based and example-based explanation methods

Andrei V. Konstantinov; Boris V. Kozlov; Stanislav R. Kirpichenko; Lev V. Utkin

Dual feature-based and example-based explanation methods

Andrei V. Konstantinov, Boris V. Kozlov, Stanislav R. Kirpichenko, Lev V. Utkin

TL;DR

The paper addresses explainability for black-box models by introducing a dual feature-based and example-based framework built on convex hull duality. It replaces perturbations in the original feature space with uniform sampling on the unit simplex of convex coefficients derived from the hull of $K$ nearest neighbors around the explainable instance, yielding a dual dataset for a linear surrogate $h(\boldsymbol{\lambda})$. The primal explanation coefficients are recovered from the dual via $\mathbf{aX}=\mathbf{b}$, enabling both feature-based and example-based explanations, with NAM applicable on the dual features to obtain interpretable shape functions. Empirical results on synthetic and real data show robustness to out-of-domain perturbations, improved or competitive accuracy relative to LIME, and coherent example-based explanations, while code availability supports reproducibility. The approach offers a flexible, perturbation-free alternative that can be extended to global explanations and other surrogate models, with practical impact for reliable model interpretation in safety-critical domains.

Abstract

A new approach to the local and global explanation is proposed. It is based on selecting a convex hull constructed for the finite number of points around an explained instance. The convex hull allows us to consider a dual representation of instances in the form of convex combinations of extreme points of a produced polytope. Instead of perturbing new instances in the Euclidean feature space, vectors of convex combination coefficients are uniformly generated from the unit simplex, and they form a new dual dataset. A dual linear surrogate model is trained on the dual dataset. The explanation feature importance values are computed by means of simple matrix calculations. The approach can be regarded as a modification of the well-known model LIME. The dual representation inherently allows us to get the example-based explanation. The neural additive model is also considered as a tool for implementing the example-based explanation approach. Many numerical experiments with real datasets are performed for studying the approach. The code of proposed algorithms is available.

Dual feature-based and example-based explanation methods

TL;DR

nearest neighbors around the explainable instance, yielding a dual dataset for a linear surrogate

. The primal explanation coefficients are recovered from the dual via

, enabling both feature-based and example-based explanations, with NAM applicable on the dual features to obtain interpretable shape functions. Empirical results on synthetic and real data show robustness to out-of-domain perturbations, improved or competitive accuracy relative to LIME, and coherent example-based explanations, while code availability supports reproducibility. The approach offers a flexible, perturbation-free alternative that can be extended to global explanations and other surrogate models, with practical impact for reliable model interpretation in safety-critical domains.

Abstract

Paper Structure (17 sections, 29 equations, 13 figures, 4 tables, 1 algorithm)

This paper contains 17 sections, 29 equations, 13 figures, 4 tables, 1 algorithm.

Introduction
Related work
Preliminaries
Convex hull
LIME, SHAP, NAM and example-based methods
Dual explanation
Example-based explanation and NAM
Numerical experiments with the feature-based explanation
Example 1
Example 2
Example 3
Example 4
Numerical experiments with the example-based explanation
Example 1
Example 2
...and 2 more sections

Figures (13)

Figure 1: An illustration of a case of out-of-domain data when generated points may be out of the training point domain
Figure 2: Two cases of the explained point location and the convex polytops constructed from $K$ nearest neighbors
Figure 3: Steps of the algorithm for explanation of a prediction provided by a black-box model at the point depicted by the small triangle
Figure 4: Generated points in the original LIME (a) and in the proposed dual method (b)
Figure 5: A scheme of training NAM on the generated set of random vectors $\mathbf{\lambda}$
...and 8 more figures

Dual feature-based and example-based explanation methods

TL;DR

Abstract

Dual feature-based and example-based explanation methods

Authors

TL;DR

Abstract

Table of Contents

Figures (13)