Stein Variational Belief Propagation for Multi-Robot Coordination
Jana Pavlasek, Joshua Jing Zhi Mah, Ruihan Xu, Odest Chadwicke Jenkins, Fabio Ramos
TL;DR
The paper tackles decentralized multi-robot coordination under uncertainty by modeling the robot swarm as a Markov Random Field and performing nonparametric marginal inference with Stein Variational Belief Propagation (SVBP). SVBP combines Stein Variational Gradient Descent with Particle Belief Propagation to maintain multi-modal marginal beliefs at each node, enabling parallelizable, GPU-friendly updates. Empirical results in both perception and planning tasks show SVBP better preserves true marginals and yields more robust planning, reducing deadlocks and producing smoother trajectories compared to PBP and GaBP, with successful real-robot demonstrations. The approach offers a scalable, decentralized alternative for high-dimensional, multi-agent coordination, robust to noise and asynchronous communication, while acknowledging computational scaling with graph degree and synchronization challenges as future work.
Abstract
Decentralized coordination for multi-robot systems involves planning in challenging, high-dimensional spaces. The planning problem is particularly challenging in the presence of obstacles and different sources of uncertainty such as inaccurate dynamic models and sensor noise. In this paper, we introduce Stein Variational Belief Propagation (SVBP), a novel algorithm for performing inference over nonparametric marginal distributions of nodes in a graph. We apply SVBP to multi-robot coordination by modelling a robot swarm as a graphical model and performing inference for each robot. We demonstrate our algorithm on a simulated multi-robot perception task, and on a multi-robot planning task within a Model-Predictive Control (MPC) framework, on both simulated and real-world mobile robots. Our experiments show that SVBP represents multi-modal distributions better than sampling-based or Gaussian baselines, resulting in improved performance on perception and planning tasks. Furthermore, we show that SVBP's ability to represent diverse trajectories for decentralized multi-robot planning makes it less prone to deadlock scenarios than leading baselines.
