DISPATCH -- Decentralized Informed Spatial Planning and Assignment of Tasks for Cooperative Heterogeneous Agents
Yao Liu, Sampad Mohanty, Elizabeth Ondula, Bhaskar Krishnamachari
TL;DR
This work tackles the fairness-efficiency tradeoff in spatial, online task allocation for heterogeneous multi-agent systems under partial observability. It fuses Eisenberg–Gale equilibrium theory with decentralized learning through a CTDE framework (EG-MARL) and a stochastic online EG-based assignment mechanism, yielding policies that approximate Pareto-efficient and envy-free allocations. Empirical results in MPE and Webots warehouse scenarios show that EG-based methods achieve strong fairness while maintaining competitive efficiency, with regret decreasing in smaller teams and approaching centralized performance as coordination scales. The study provides a principled path to fairness-aware coordination without relying on full global state, with practical implications for real-time, heterogeneous robot fleets.
Abstract
Spatial task allocation in systems such as multi-robot delivery or ride-sharing requires balancing efficiency with fair service across tasks. Greedy assignment policies that match each agent to its highest-preference or lowest-cost task can maximize efficiency but often create inequities: some tasks receive disproportionately favorable service (e.g., shorter delays or better matches), while others face long waits or poor allocations. We study fairness in heterogeneous multi-agent systems where tasks vary in preference alignment and urgency. Most existing approaches either assume centralized coordination or largely ignore fairness under partial observability. Distinct from this prior work, we establish a connection between the Eisenberg-Gale (EG) equilibrium convex program and decentralized, partially observable multi-agent learning. Building on this connection, we develop two equilibrium-informed algorithms that integrate fairness and efficiency: (i) a multi-agent reinforcement learning (MARL) framework, EG-MARL, whose training is guided by a centralized EG equilibrium assignment algorithm; and (ii) a stochastic online optimization mechanism that performs guided exploration and subset-based fair assignment as tasks are discovered. We evaluate on Multi-Agent Particle Environment (MPE) simulations across varying team sizes against centralized EG, Hungarian, and Min-Max distance baselines, and also present a Webots-based warehouse proof-of-concept with heterogeneous robots. Both methods preserve the fairness-efficiency balance of the EG solution under partial observability, with EG-MARL achieving near-centralized coordination and reduced travel distances, and the online mechanism enabling real-time allocation with competitive fairness.
