Leveraging Counterfactual Paths for Contrastive Explanations of POMDP Policies
Benjamin Kraske, Zakariya Laouar, Zachary Sunberg
TL;DR
This work tackles the problem of making POMDP policies transparent by using user-provided counterfactual paths to generate contrastive explanations. It defines feature occupancy and feature expectations to compare an open-loop counterfactual policy against a near-optimal SAR POMDP policy, enabling interpretable distinctions in terms of problem objectives and constraints. Using SARSOP to obtain an approximate optimal policy, the approach demonstrates how differences in feature expectations reveal why certain paths are chosen or avoided, especially under uncertainty and resource limits. Through two SAR case studies, the paper shows how observable versus unobservable objectives and battery constraints shape explanations, contributing to trust and effective human–robot collaboration in autonomous search tasks.
Abstract
As humans come to rely on autonomous systems more, ensuring the transparency of such systems is important to their continued adoption. Explainable Artificial Intelligence (XAI) aims to reduce confusion and foster trust in systems by providing explanations of agent behavior. Partially observable Markov decision processes (POMDPs) provide a flexible framework capable of reasoning over transition and state uncertainty, while also being amenable to explanation. This work investigates the use of user-provided counterfactuals to generate contrastive explanations of POMDP policies. Feature expectations are used as a means of contrasting the performance of these policies. We demonstrate our approach in a Search and Rescue (SAR) setting. We analyze and discuss the associated challenges through two case studies.
