Multi-Agent Strategy Explanations for Human-Robot Collaboration
Ravi Pandya, Michelle Zhao, Changliu Liu, Reid Simmons, Henny Admoni
TL;DR
This work addresses the challenge of coordinating human-robot teams in settings with multiple Nash equilibria by introducing strategy-conditioned landmarks as a visual and textual explanation mechanism. It formalizes the problem as a HiP-MDP with a latent strategy space $\mathcal{Z}$, then clusters strategies, derives landmark states $S_{land}$, and generates explanations via an LLM to enable proactive strategy alignment. The method is evaluated in two domains, Collaborative Maze and Social Navigation, using Co-GAIL and ILQGames to obtain diverse strategy clusters; user studies show that landmark-based explanations improve zero-shot coordination and strategy exploration, with video-only baselines being less effective. The approach demonstrates a practical path toward proactive, explainable multi-agent collaboration, with future work focusing on scalability, home and real-world deployment, and safety considerations in physical robotics.
Abstract
As robots are deployed in human spaces, it is important that they are able to coordinate their actions with the people around them. Part of such coordination involves ensuring that people have a good understanding of how a robot will act in the environment. This can be achieved through explanations of the robot's policy. Much prior work in explainable AI and RL focuses on generating explanations for single-agent policies, but little has been explored in generating explanations for collaborative policies. In this work, we investigate how to generate multi-agent strategy explanations for human-robot collaboration. We formulate the problem using a generic multi-agent planner, show how to generate visual explanations through strategy-conditioned landmark states and generate textual explanations by giving the landmarks to an LLM. Through a user study, we find that when presented with explanations from our proposed framework, users are able to better explore the full space of strategies and collaborate more efficiently with new robot partners.
