BEE: Metric-Adapted Explanations via Baseline Exploration-Exploitation
Oren Barkan, Yehonatan Elisha, Jonathan Weill, Noam Koenigstein
TL;DR
This work tackles the lack of a universal evaluation metric and baseline representation in explainable AI by introducing Baseline Exploration-Exploitation (BEE). BEE treats the baseline as a learned random tensor drawn from a mixture of distributions and optimizes it through contextual exploration-exploitation to tailor explanations to a given metric. Explanations are produced via a path-integrated scheme over intermediate representations, yielding multiple maps across layers and selecting the best performing map per metric; the method supports pretrained and inference-time finetuning. Across ImageNet experiments with CNN and ViT backbones, BEE achieves state-of-the-art performance on a broad set of objective metrics, demonstrating strong generalization and the ability to adapt explanations to diverse evaluation criteria.
Abstract
Two prominent challenges in explainability research involve 1) the nuanced evaluation of explanations and 2) the modeling of missing information through baseline representations. The existing literature introduces diverse evaluation metrics, each scrutinizing the quality of explanations through distinct lenses. Additionally, various baseline representations have been proposed, each modeling the notion of missingness differently. Yet, a consensus on the ultimate evaluation metric and baseline representation remains elusive. This work acknowledges the diversity in explanation metrics and baselines, demonstrating that different metrics exhibit preferences for distinct explanation maps resulting from the utilization of different baseline representations and distributions. To address the diversity in metrics and accommodate the variety of baseline representations in a unified manner, we propose Baseline Exploration-Exploitation (BEE) - a path-integration method that introduces randomness to the integration process by modeling the baseline as a learned random tensor. This tensor follows a learned mixture of baseline distributions optimized through a contextual exploration-exploitation procedure to enhance performance on the specific metric of interest. By resampling the baseline from the learned distribution, BEE generates a comprehensive set of explanation maps, facilitating the selection of the best-performing explanation map in this broad set for the given metric. Extensive evaluations across various model architectures showcase the superior performance of BEE in comparison to state-of-the-art explanation methods on a variety of objective evaluation metrics.
