Challenges in Human-Agent Communication
Gagan Bansal, Jennifer Wortman Vaughan, Saleema Amershi, Eric Horvitz, Adam Fourney, Hussein Mozannar, Victor Dibia, Daniel S. Weld
TL;DR
This work frames human–agent communication around common ground, grounded in cognitive science and HCI, to address twelve challenges arising from modern, tool-using base models. It classifies these into three broad families (A1–A5, U1–U3, X1–X4) and provides concrete examples across pre-, during-, and post-action phases, illustrating both goals-to-be-achieved and capabilities-to-be-disclosed. The contribution is a structured taxonomy of information flow challenges, with analysis of corresponding design directions and open research questions, emphasizing transparency, controllability, and iterative verification. The paper advocates cross-disciplinary effort to develop patterns, guidelines, and interfaces that sustain shared understanding in high-stakes, multi-agent workflows, ultimately improving safety and user trust in autonomous systems.
Abstract
Remarkable advancements in modern generative foundation models have enabled the development of sophisticated and highly capable autonomous agents that can observe their environment, invoke tools, and communicate with other agents to solve problems. Although such agents can communicate with users through natural language, their complexity and wide-ranging failure modes present novel challenges for human-AI interaction. Building on prior research and informed by a communication grounding perspective, we contribute to the study of \emph{human-agent communication} by identifying and analyzing twelve key communication challenges that these systems pose. These include challenges in conveying information from the agent to the user, challenges in enabling the user to convey information to the agent, and overarching challenges that need to be considered across all human-agent communication. We illustrate each challenge through concrete examples and identify open directions of research. Our findings provide insights into critical gaps in human-agent communication research and serve as an urgent call for new design patterns, principles, and guidelines to support transparency and control in these systems.
