Axiomatic Explainer Globalness via Optimal Transport
Davin Hill, Josh Bone, Aria Masoomi, Max Torop, Jennifer Dy
TL;DR
The paper tackles the challenge of evaluating explainers by introducing Wasserstein Globalness ($G_p$), a $p$-Wasserstein distance-based metric that quantifies the diversity of explanations a given explainer produces over a dataset. It defines a formal axiomatic framework with six properties, proves that $G_p$ satisfies these properties, and provides finite-sample bounds for its empirical estimator $\\hat{G}_p$, enabling practical use with discrete or continuous explanations and flexible distance metrics $d_{\\mathcal{E}}$. The approach is explainer-agnostic and includes normalization to a [0,1] scale, making it comparable across methods and domains; it can utilize efficient approximations like Sliced Wasserstein or Sinkhorn. Empirical results on image, tabular, and synthetic data show that WG complements faithfulness metrics by capturing explanation diversity, aiding in the selection of lower-complexity yet faithful explainers, and revealing when explanations truly reflect underlying data structure.
Abstract
Explainability methods are often challenging to evaluate and compare. With a multitude of explainers available, practitioners must often compare and select explainers based on quantitative evaluation metrics. One particular differentiator between explainers is the diversity of explanations for a given dataset; i.e. whether all explanations are identical, unique and uniformly distributed, or somewhere between these two extremes. In this work, we define a complexity measure for explainers, globalness, which enables deeper understanding of the distribution of explanations produced by feature attribution and feature selection methods for a given dataset. We establish the axiomatic properties that any such measure should possess and prove that our proposed measure, Wasserstein Globalness, meets these criteria. We validate the utility of Wasserstein Globalness using image, tabular, and synthetic datasets, empirically showing that it both facilitates meaningful comparison between explainers and improves the selection process for explainability methods.
