Towards Balancing Preference and Performance through Adaptive Personalized Explainability
Andrew Silva, Pradyumna Tambwekar, Mariah Schrum, Matthew Gombolay
TL;DR
The paper investigates how to balance the usefulness of explanations with user preferences in human-robot interaction by studying explainability modalities in a simulated autonomous-vehicle domain and proposing an adaptive personalization framework. It compares language explanations, feature-importance maps, and decision trees across population preferences and task performance, finding language explanations generally preferred and more effective, while noting modality-specific tradeoffs. Building on this, the authors develop an adaptive personalization approach that combines user preferences and task performance via sampling distributions and a neural predictor to tune explanations in real time. The empirical results show that balanced personalization improves performance relative to non-adaptive or random explanations and is not worse than using the best-known modality, suggesting a practical pathway to deploy personalized xAI in real-world HRI. The work highlights the importance of considering both user preferences and task success, offering design principles and a concrete adaptive mechanism with broad applicability in explainable AI for human-robot collaboration.
Abstract
As robots and digital assistants are deployed in the real world, these agents must be able to communicate their decision-making criteria to build trust, improve human-robot teaming, and enable collaboration. While the field of explainable artificial intelligence (xAI) has made great strides to enable such communication, these advances often assume that one xAI approach is ideally suited to each problem (e.g., decision trees to explain how to triage patients in an emergency or feature-importance maps to explain radiology reports). This fails to recognize that users have diverse experiences or preferences for interaction modalities. In this work, we present two user-studies set in a simulated autonomous vehicle (AV) domain. We investigate (1) population-level preferences for xAI and (2) personalization strategies for providing robot explanations. We find significant differences between xAI modes (language explanations, feature-importance maps, and decision trees) in both preference (p < 0.01) and performance (p < 0.05). We also observe that a participant's preferences do not always align with their performance, motivating our development of an adaptive personalization strategy to balance the two. We show that this strategy yields significant performance gains (p < 0.05), and we conclude with a discussion of our findings and implications for xAI in human-robot interactions.
