Practical Attribution Guidance for Rashomon Sets
Sichao Li, Amanda S. Barnard, Quanling Deng
TL;DR
This work addresses the practical challenge of exploring Rashomon sets—collections of near-optimal models with potentially conflicting explanations—by formalizing two core axioms: generalizability and implementation sparsity. It introduces the General Rashomon Subset Sampling (GRSS) framework, which leverages an ε-subgradient optimization approach to sample a generalized Rashomon set represented via input- or output-activated transformations of the data, along with a model- and task-agnostic generalized feature-attribution mechanism. The authors propose metrics such as SER, dist_inf, and FER to quantify search efficiency and attribution diversity, and they validate the framework on synthetic and real-world datasets, showing that it can recover meaningful, non-redundant attributions across model families and epsilons. The approach offers a practical, theory-backed pathway to interpretable AI that accounts for model-interpretation variability, enabling more robust explanations and decision-making in complex tasks.
Abstract
Different prediction models might perform equally well (Rashomon set) in the same task, but offer conflicting interpretations and conclusions about the data. The Rashomon effect in the context of Explainable AI (XAI) has been recognized as a critical factor. Although the Rashomon set has been introduced and studied in various contexts, its practical application is at its infancy stage and lacks adequate guidance and evaluation. We study the problem of the Rashomon set sampling from a practical viewpoint and identify two fundamental axioms - generalizability and implementation sparsity that exploring methods ought to satisfy in practical usage. These two axioms are not satisfied by most known attribution methods, which we consider to be a fundamental weakness. We use the norms to guide the design of an $ε$-subgradient-based sampling method. We apply this method to a fundamental mathematical problem as a proof of concept and to a set of practical datasets to demonstrate its ability compared with existing sampling methods.
