Bayesian Optimization Framework for Efficient Fleet Design in Autonomous Multi-Robot Exploration
David Molina Concha, Jiping Li, Haoran Yin, Kyeonghyeon Park, Hyun-Rok Lee, Taesik Lee, Dhruv Sirohi, Chi-Guhn Lee
TL;DR
This work tackles the problem of designing heterogeneous autonomous robot fleets for efficient exploration under expensive, high-dimensional evaluations. It introduces BOFD, a bi-level framework that uses multi-objective Bayesian optimization to search discrete fleet compositions and a lower-level MARL method (MADDPG) to evaluate performance, while leveraging a known linear acquisition cost. A sub-linear regret bound is established to guarantee robustness, and extensive synthetic and simulated experiments show that BOFD achieves near-optimal fleet designs with far fewer evaluations than baselines. The approach promises substantial practical impact by accelerating fleet design and reducing computational costs in complex autonomous multi-robot systems.
Abstract
This study addresses the challenge of fleet design optimization in the context of heterogeneous multi-robot fleets, aiming to obtain feasible designs that balance performance and costs. In the domain of autonomous multi-robot exploration, reinforcement learning agents play a central role, offering adaptability to complex terrains and facilitating collaboration among robots. However, modifying the fleet composition results in changes in the learned behavior, and training multi-robot systems using multi-agent reinforcement learning is expensive. Therefore, an exhaustive evaluation of each potential fleet design is infeasible. To tackle these hurdles, we introduce Bayesian Optimization for Fleet Design (BOFD), a framework leveraging multi-objective Bayesian Optimization to explore fleets on the Pareto front of performance and cost while accounting for uncertainty in the design space. Moreover, we establish a sub-linear bound for cumulative regret, supporting BOFD's robustness and efficacy. Extensive benchmark experiments in synthetic and simulated environments demonstrate the superiority of our framework over state-of-the-art methods, achieving efficient fleet designs with minimal fleet evaluations.
