Bayesian Decentralized Decision-making for Multi-Robot Systems: Sample-efficient Estimation of Event Rates
Gabriel Aguirre, Simay Atasoy Bingöl, Heiko Hamann, Jonas Kuckling
TL;DR
The paper tackles identifying the safer of two areas under unknown stochastic hazards in a multi-robot swarm. It introduces a decentralized Bayesian framework where each robot privately models interarrival times of hazardous events using a Weibull-exponential assumption with an inverse-gamma conjugate prior, and reaches consensus via a DMMD-based mechanism. Results show that DMMD-driven strategies can achieve correct and faster consensus with reduced exposure compared to a baseline, though belief sharing can induce faster but sometimes biased decisions. The work demonstrates a path toward risk-aware, sample-efficient exploration benchmarks for dynamic environments and suggests directions for handling censored data and extending to multiple options. Overall, it offers a principled approach to combining Bayesian inference with swarm-level majority dynamics for safe autonomous exploration.
Abstract
Effective collective decision-making in swarm robotics often requires balancing exploration, communication and individual uncertainty estimation, especially in hazardous environments where direct measurements are limited or costly. We propose a decentralized Bayesian framework that enables a swarm of simple robots to identify the safer of two areas, each characterized by an unknown rate of hazardous events governed by a Poisson process. Robots employ a conjugate prior to gradually predict the times between events and derive confidence estimates to adapt their behavior. Our simulation results show that the robot swarm consistently chooses the correct area while reducing exposure to hazardous events by being sample-efficient. Compared to baseline heuristics, our proposed approach shows better performance in terms of safety and speed of convergence. The proposed scenario has potential to extend the current set of benchmarks in collective decision-making and our method has applications in adaptive risk-aware sampling and exploration in hazardous, dynamic environments.
