Locally-Minimal Probabilistic Explanations
Yacine Izza, Kuldeep S. Meel, Joao Marques-Silva
TL;DR
This work tackles the challenge of producing rigorous yet compact explanations for complex ML models by focusing on locally-minimal probabilistic abductive explanations (LmPAXp). It formalizes probabilistic AXps (PAXp) and their locally-minimal variant, discusses their non-monotonic nature, and introduces efficient approximate computation methods using either PAC-based approximate model counting or Monte-Carlo sampling. The authors develop logic-based encodings for BNNs and RFs to support these explanations and propose a deletion-based algorithm to extract small LmPAXps with practical guarantees. Empirical results on RFs and BNNs show substantial reductions in explanation size with high precision, demonstrating the practicality of the approach for scalable XAI in real-world classifiers. The work highlights a trade-off between explanation compactness and probabilistic precision, and points to future integration with SMT to broaden applicability beyondBoolean domains.
Abstract
Explainable Artificial Intelligence (XAI) is widely regarding as a cornerstone of trustworthy AI. Unfortunately, most work on XAI offers no guarantees of rigor. In high-stakes domains, e.g. uses of AI that impact humans, the lack of rigor of explanations can have disastrous consequences. Formal abductive explanations offer crucial guarantees of rigor and so are of interest in high-stakes uses of machine learning (ML). One drawback of abductive explanations is explanation size, justified by the cognitive limits of human decision-makers. Probabilistic abductive explanations (PAXps) address this limitation, but their theoretical and practical complexity makes their exact computation most often unrealistic. This paper proposes novel efficient algorithms for the computation of locally-minimal PXAps, which offer high-quality approximations of PXAps in practice. The experimental results demonstrate the practical efficiency of the proposed algorithms.
