Explainable Human-AI Interaction: A Planning Perspective
Sarath Sreedharan, Anagha Kulkarni, Subbarao Kambhampati
TL;DR
This work surveys a planning-centric framework for explainable human-AI interaction, emphasizing how AI agents can form and use mental models of humans to generate explicable, legible, and predictable behavior. It introduces model-reconciliation explanations, and model-space search techniques to produce minimal, contrastive explanations that align human and machine plans, while also exploring obfuscation and deception in adversarial settings. The text covers mathematical formalisms for COPP, explicable and legible planning, design-of-environments, and balanced planning that integrates communication costs with plan optimality. It culminates with broad applications in collaborative decision-making, human-robot teams, and actionable guidance for building human-centric AI, including trust and longitudinal interaction, learning human models, and safety considerations for advanced AI systems.
Abstract
From its inception, AI has had a rather ambivalent relationship with humans -- swinging between their augmentation and replacement. Now, as AI technologies enter our everyday lives at an ever increasing pace, there is a greater need for AI systems to work synergistically with humans. One critical requirement for such synergistic human-AI interaction is that the AI systems be explainable to the humans in the loop. To do this effectively, AI agents need to go beyond planning with their own models of the world, and take into account the mental model of the human in the loop. Drawing from several years of research in our lab, we will discuss how the AI agent can use these mental models to either conform to human expectations, or change those expectations through explanatory communication. While the main focus of the book is on cooperative scenarios, we will point out how the same mental models can be used for obfuscation and deception. Although the book is primarily driven by our own research in these areas, in every chapter, we will provide ample connections to relevant research from other groups.
