Table of Contents
Fetching ...

AgentCoord: Visually Exploring Coordination Strategy for LLM-based Multi-Agent Collaboration

Bo Pan, Jiaying Lu, Ke Wang, Li Zheng, Zhen Wen, Yingchaojie Feng, Minfeng Zhu, Wei Chen

TL;DR

The paper tackles the challenge of designing coordination strategies for LLM-based multi-agent systems, where natural language ambiguity and large text content impede efficient design. It introduces AgentCoord, a visual exploration framework that uses a structured representation and a three-stage generation method (Plan Outline, Agent Assignment, Task Process) to map user goals to executable strategies, complemented by interactive exploration and visually enhanced execution analysis. The authors validate the approach with a formal user study, showing improved strategy comprehension, exploration flexibility, and result analysis compared with two baselines. This work enables broader, more approachable design of coordinated multi-agent workflows and supports rapid prototyping and evaluation of coordination strategies.

Abstract

The potential of automatic task-solving through Large Language Model (LLM)-based multi-agent collaboration has recently garnered widespread attention from both the research community and industry. While utilizing natural language to coordinate multiple agents presents a promising avenue for democratizing agent technology for general users, designing coordination strategies remains challenging with existing coordination frameworks. This difficulty stems from the inherent ambiguity of natural language for specifying the collaboration process and the significant cognitive effort required to extract crucial information (e.g. agent relationship, task dependency, result correspondence) from a vast amount of text-form content during exploration. In this work, we present a visual exploration framework to facilitate the design of coordination strategies in multi-agent collaboration. We first establish a structured representation for LLM-based multi-agent coordination strategy to regularize the ambiguity of natural language. Based on this structure, we devise a three-stage generation method that leverages LLMs to convert a user's general goal into an executable initial coordination strategy. Users can further intervene at any stage of the generation process, utilizing LLMs and a set of interactions to explore alternative strategies. Whenever a satisfactory strategy is identified, users can commence the collaboration and examine the visually enhanced execution result. We develop AgentCoord, a prototype interactive system, and conduct a formal user study to demonstrate the feasibility and effectiveness of our approach.

AgentCoord: Visually Exploring Coordination Strategy for LLM-based Multi-Agent Collaboration

TL;DR

The paper tackles the challenge of designing coordination strategies for LLM-based multi-agent systems, where natural language ambiguity and large text content impede efficient design. It introduces AgentCoord, a visual exploration framework that uses a structured representation and a three-stage generation method (Plan Outline, Agent Assignment, Task Process) to map user goals to executable strategies, complemented by interactive exploration and visually enhanced execution analysis. The authors validate the approach with a formal user study, showing improved strategy comprehension, exploration flexibility, and result analysis compared with two baselines. This work enables broader, more approachable design of coordinated multi-agent workflows and supports rapid prototyping and evaluation of coordination strategies.

Abstract

The potential of automatic task-solving through Large Language Model (LLM)-based multi-agent collaboration has recently garnered widespread attention from both the research community and industry. While utilizing natural language to coordinate multiple agents presents a promising avenue for democratizing agent technology for general users, designing coordination strategies remains challenging with existing coordination frameworks. This difficulty stems from the inherent ambiguity of natural language for specifying the collaboration process and the significant cognitive effort required to extract crucial information (e.g. agent relationship, task dependency, result correspondence) from a vast amount of text-form content during exploration. In this work, we present a visual exploration framework to facilitate the design of coordination strategies in multi-agent collaboration. We first establish a structured representation for LLM-based multi-agent coordination strategy to regularize the ambiguity of natural language. Based on this structure, we devise a three-stage generation method that leverages LLMs to convert a user's general goal into an executable initial coordination strategy. Users can further intervene at any stage of the generation process, utilizing LLMs and a set of interactions to explore alternative strategies. Whenever a satisfactory strategy is identified, users can commence the collaboration and examine the visually enhanced execution result. We develop AgentCoord, a prototype interactive system, and conduct a formal user study to demonstrate the feasibility and effectiveness of our approach.
Paper Structure (37 sections, 3 equations, 6 figures)

This paper contains 37 sections, 3 equations, 6 figures.

Figures (6)

  • Figure 1: System interface of AgentCoord. The first three views (Plan Outline View, Agent Assignment View, Task Process View) correspond to the three-stage coordination strategy generation process while the last view presents the execution result.
  • Figure 2: An illustrative example of plan outline exploration.
  • Figure 3: An illustrative example of agent assignment exploration.
  • Figure 4: An illustrative example of task process exploration.
  • Figure 5: The results of the questionnaire regarding our system’s effectiveness and usability.
  • ...and 1 more figures