Magentic-UI: Towards Human-in-the-loop Agentic Systems

Hussein Mozannar; Gagan Bansal; Cheng Tan; Adam Fourney; Victor Dibia; Jingya Chen; Jack Gerrits; Tyler Payne; Matheus Kunzler Maldaner; Madeleine Grunde-McLaughlin; Eric Zhu; Griffin Bassman; Jacob Alber; Peter Chang; Ricky Loynd; Friederike Niedtner; Ece Kamar; Maya Murad; Rafah Hosn; Saleema Amershi

Magentic-UI: Towards Human-in-the-loop Agentic Systems

Hussein Mozannar, Gagan Bansal, Cheng Tan, Adam Fourney, Victor Dibia, Jingya Chen, Jack Gerrits, Tyler Payne, Matheus Kunzler Maldaner, Madeleine Grunde-McLaughlin, Eric Zhu, Griffin Bassman, Jacob Alber, Peter Chang, Ricky Loynd, Friederike Niedtner, Ece Kamar, Maya Murad, Rafah Hosn, Saleema Amershi

TL;DR

<3-5 sentence high-level summary> Magentic-UI addresses the safety and productivity gaps of autonomous LLM agents by providing an open-source, human-in-the-loop interface with a flexible multi-agent architecture. It introduces six interaction mechanisms—co-planning, co-tasking, action guards, verification, memory, and multi-tasking—and demonstrates how these enable safe, low-cost collaboration between humans and agents across web browsing, coding, and file tasks. Through autonomous, simulated-user, and qualitative evaluations on agentic benchmarks, it shows potential to improve task success and user oversight while highlighting remaining challenges in latency, plan predictability, and safety. The work offers a practical platform for researching, comparing, and extending human-agent collaboration strategies in realistic computer-use workflows.</paper_summary>

Abstract

AI agents powered by large language models are increasingly capable of autonomously completing complex, multi-step tasks using external tools. Yet, they still fall short of human-level performance in most domains including computer use, software development, and research. Their growing autonomy and ability to interact with the outside world, also introduces safety and security risks including potentially misaligned actions and adversarial manipulation. We argue that human-in-the-loop agentic systems offer a promising path forward, combining human oversight and control with AI efficiency to unlock productivity from imperfect systems. We introduce Magentic-UI, an open-source web interface for developing and studying human-agent interaction. Built on a flexible multi-agent architecture, Magentic-UI supports web browsing, code execution, and file manipulation, and can be extended with diverse tools via Model Context Protocol (MCP). Moreover, Magentic-UI presents six interaction mechanisms for enabling effective, low-cost human involvement: co-planning, co-tasking, multi-tasking, action guards, and long-term memory. We evaluate Magentic-UI across four dimensions: autonomous task completion on agentic benchmarks, simulated user testing of its interaction capabilities, qualitative studies with real users, and targeted safety assessments. Our findings highlight Magentic-UI's potential to advance safe and efficient human-agent collaboration.

Magentic-UI: Towards Human-in-the-loop Agentic Systems

TL;DR

Abstract

Magentic-UI: Towards Human-in-the-loop Agentic Systems

TL;DR

Abstract

Paper Structure

Table of Contents

Figures (15)