Explainable Learning with Gaussian Processes

Kurt Butler; Guanchao Feng; Petar M. Djuric

Explainable Learning with Gaussian Processes

Kurt Butler, Guanchao Feng, Petar M. Djuric

TL;DR

This work takes a principled approach to defining attributions under model uncertainty, extending the existing literature, and shows that although GPR is a highly flexible and non-parametric approach, it can derive interpretable, closed-form expressions for the feature attributions.

Abstract

The field of explainable artificial intelligence (XAI) attempts to develop methods that provide insight into how complicated machine learning methods make predictions. Many methods of explanation have focused on the concept of feature attribution, a decomposition of the model's prediction into individual contributions corresponding to each input feature. In this work, we explore the problem of feature attribution in the context of Gaussian process regression (GPR). We take a principled approach to defining attributions under model uncertainty, extending the existing literature. We show that although GPR is a highly flexible and non-parametric approach, we can derive interpretable, closed-form expressions for the feature attributions. When using integrated gradients as an attribution method, we show that the attributions of a GPR model also follow a Gaussian process distribution, which quantifies the uncertainty in attribution arising from uncertainty in the model. We demonstrate, both through theory and experimentation, the versatility and robustness of this approach. We also show that, when applicable, the exact expressions for GPR attributions are both more accurate and less computationally expensive than the approximations currently used in practice. The source code for this project is freely available under MIT license at https://github.com/KurtButler/2024_attributions_paper.

Explainable Learning with Gaussian Processes

TL;DR

Abstract

Paper Structure (34 sections, 7 theorems, 105 equations, 7 figures)

This paper contains 34 sections, 7 theorems, 105 equations, 7 figures.

Introduction
Related Work
Contributions
The Feature Attribution Problem
Motivation: Bayesian Linear Regression
Integrated Gradients
Geometric Interpretation
Axiomatic Interpretation
Linear Operators
Gaussian Process Regression
Covariance Functions
Derivatives and Integrals of GPs
Spectral Approximations to GPR
Attributions in GPR Models
SE and ARD-SE Kernels
...and 19 more sections

Key Result

Lemma 1

If $F \sim \mathcal{GP}(m,k)$, where $m\in C^1(\mathbb{R}^D)$ and $k\in C^2 (\mathbb{R}^D \times \mathbb{R}^D)$, then

Figures (7)

Figure 1: A demonstration that the uncertainty in an attribution grows with the distance from the baseline, using data simulated with the model in \ref{['eq:syntheticdatamodel']}. The baseline is $\tilde{\mathbf{x}}=\mathbf{0}$ and we compute attributions for predictions along the path $\boldsymbol{\beta}(t)=t[1,1]^\top$.
Figure 2: Feature attribution for patients from the breast cancer data set. Predictions were made using a GPR model with ARD-SE kernel.
Figure 3: Feature attributions for a prediction using the Taipei Housing Data.
Figure 4: Convergence of attributions with the approximated integral, \ref{['eq:approxIG']}, to the exact attributions, as the number of partitions $L$ increases.
Figure 5: Attributions to alcohol in the Wine quality data set. As the number of frequencies used in the RFGP approximation, $M$, increases, the densities of the RFGP attributions more closely align with the exact GPR attributions. We show both individual realizations of the RFGP model (the blue and red curves), and a marginalized model which integrates over 500 independent RFGP models (the red shaded region).
...and 2 more figures

Theorems & Definitions (7)

Lemma 1: Derivatives of GPs
Lemma 2: Integral of a GP
Theorem 1: Integrated gradients preserve Gaussianity
Corollary 1: Attributions of the GPR Posterior-Predictive
Lemma 3: Scaling preserves Gaussianity
Lemma 4: Precompositions preserve Gaussianity
Proposition 1

Explainable Learning with Gaussian Processes

TL;DR

Abstract

Explainable Learning with Gaussian Processes

Authors

TL;DR

Abstract

Table of Contents

Key Result

Figures (7)

Theorems & Definitions (7)