AgentRouter: A Knowledge-Graph-Guided LLM Router for Collaborative Multi-Agent Question Answering
Zheyuan Zhang, Kaiwen Shi, Zhengqing Yuan, Zehong Wang, Tianyi Ma, Keerthiram Murugesan, Vincent Galassi, Chuxu Zhang, Yanfang Ye
TL;DR
AgentRouter tackles the problem of selecting and coordinating among heterogeneous LLM-based agents for question answering by formulating it as a knowledge-graph-guided routing task. It constructs a knowledge graph that jointly encodes queries, contextual entities, and candidate agents, and trains a type-aware RouterGNN to produce a task-specific distribution over agents, using KL divergence to align routing with empirical performance $p^*(a|q)$. The final answer is obtained by a weighted fusion $\\hat{y}(q)=\phi(\{y_a(q),p_\theta(a|q,\mathcal{G})\})$, enabling principled collaboration that leverages complementary strengths. Extensive experiments across multi-hop and direct QA benchmarks show AgentRouter outperforms single agents and prior routing baselines, with robust generalization across backbones and tasks, and reveal the value of contextual graph signals and soft supervision for adaptive collaboration.
Abstract
Large language models (LLMs) and agent-based frameworks have advanced rapidly, enabling diverse applications. Yet, with the proliferation of models and agentic strategies, practitioners face substantial uncertainty in selecting the best configuration for a downstream task. Prior studies show that different agents and backbones exhibit complementary strengths, and that larger models are not always superior, underscoring the need for adaptive routing mechanisms. Existing approaches to agent routing, however, often emphasize cost efficiency while overlooking the fine-grained contextual and relational structure inherent in QA tasks. In this paper, we propose tAgentRouter, a framework that formulates multi-agent QA as a knowledge-graph-guided routing problem supervised by empirical performance signals. Specifically, we convert QA instance into a knowledge graph that jointly encodes queries, contextual entities, and agents, and then train a heterogeneous graph neural network (GNN) to propagate information across node types and produce task-aware routing distributions over agents. By leveraging soft supervision and weighted aggregation of agent outputs, AgentRouter learns principled collaboration schemes that capture the complementary strengths of diverse agents. Extensive experiments demonstrate that our framework consistently outperforms single-agent and ensemble baselines, while generalizing across benchmarks and LLM backbones. These results highlight the effectiveness and robustness of graph-supervised multi-agent routing for question answering.
