A Computationally Efficient Maximum A Posteriori Sequence Estimation via Stein Variational Inference
Min-Won Seo, Solmaz S. Kia
TL;DR
This work tackles MAP sequence estimation under multimodal posteriors in robotics by coupling SVGD with a Viterbi-style DP in a sequential variational framework. A two-stage approach first builds a compact, particle-based discretization of time-evolving posteriors via SVGD, then performs forward DP and backtracking to recover a globally optimal MAP trajectory, with theoretical guarantees that ELBO maximization aligns with MAP trajectory recovery as the mollifier vanishes. The method achieves substantial accuracy and robustness improvements across nonlinear, data-association, range-localization, and high-dimensional manipulation tasks while using far fewer particles than traditional particle- filter–based MAP methods, and it benefits from easy parallelization on modern hardware. The results suggest Stein-MAP-Seq as both a standalone MAP-Seq estimator and a practical initialization front-end for batch MAP optimization, enabling reliable trajectory recovery in challenging multimodal settings with real-time potential.
Abstract
State estimation in robotic systems presents significant challenges, particularly due to the prevalence of multimodal posterior distributions in real-world scenarios. One effective strategy for handling such complexity is to compute maximum a posteriori (MAP) sequences over a discretized or sampled state space, which enables a concise representation of the most likely state trajectory. However, this approach often incurs substantial computational costs, especially in high-dimensional settings. In this article, we propose a novel MAP sequence estimation method, Stein-MAP-Seq, which effectively addresses multimodality while substantially reducing computational and memory overhead. Our key contribution is a sequential variational inference framework that captures temporal dependencies in dynamical system models and integrates Stein variational gradient descent (SVGD) into a Viterbi-style dynamic programming algorithm, enabling computationally efficient MAP sequence estimation. This integration allows the method to focus computational effort on MAP-consistent modes rather than exhaustively exploring the entire state space. Stein-MAP-Seq inherits the parallelism and mode-seeking behavior of SVGD, allowing particle updates to be efficiently executed on parallel hardware and significantly reducing the number of trajectory candidates required for MAP-sequence recursion compared to conventional methods that rely on hundreds to thousands of particles. We validate the proposed approach on a range of highly multimodal scenarios, including nonlinear dynamics with ambiguous observations, unknown data association with outliers, range-only localization under temporary unobservability, and high-dimensional robotic manipulators. Experimental results demonstrate substantial improvements in estimation accuracy and robustness to multimodality over existing estimation methods.
