Overseeing Agents Without Constant Oversight: Challenges and Opportunities
Madeleine Grunde-McLaughlin, Hussein Mozannar, Maya Murad, Jingya Chen, Saleema Amershi, Adam Fourney
TL;DR
The paper investigates how to oversee agentic AI without constant oversight by examining trace presentation in Computer Use Agents. Through a formative study, three design probes, and a controlled experiment, it demonstrates that verbose, step-by-step traces hinder error detection, while an outcome-oriented, specification-based interface can reduce error-finding time but may inflate confidence without improving final accuracy. The findings highlight challenges in conveying agent process provenance, underspecified goals, and subjective notions of correctness, and they point to integrating verification into running workflows with improved navigation. Practically, the work advances trace design by balancing process visibility with outcome checks and by emphasizing navigation, assumptions, and binding between requirements and executed steps. These insights have implications for designing safer, more trustworthy human-AI collaboration in multi-step agentic systems.
Abstract
To enable human oversight, agentic AI systems often provide a trace of reasoning and action steps. Designing traces to have an informative, but not overwhelming, level of detail remains a critical challenge. In three user studies on a Computer User Agent, we investigate the utility of basic action traces for verification, explore three alternatives via design probes, and test a novel interface's impact on error finding in question-answering tasks. As expected, we find that current practices are cumbersome, limiting their efficacy. Conversely, our proposed design reduced the time participants spent finding errors. However, although participants reported higher levels of confidence in their decisions, their final accuracy was not meaningfully improved. To this end, our study surfaces challenges for human verification of agentic systems, including managing built-in assumptions, users' subjective and changing correctness criteria, and the shortcomings, yet importance, of communicating the agent's process.
