Denoising Particle Filters: Learning State Estimation with Single-Step Objectives
Lennart Röstel, Berthold Bäuml
TL;DR
This work addresses robust state estimation for robotics under partial observability and non-linear dynamics. It introduces Denoising Particle Filters (DnPF), which learns state transition and measurement models using single-step objectives and performs posterior sampling with a diffusion-based denoising process guided by a learned dynamics prior and a measurement likelihood score. A key contribution is a likelihood-constrained diffusion mechanism that keeps particles close to the data manifold, enabling modular sensor fusion without retraining. Across simulated tasks in Mujoco, DnPF achieves competitive accuracy with end-to-end baselines and shows improved robustness to distribution shifts and easy integration of external sensor models.
Abstract
Learning-based methods commonly treat state estimation in robotics as a sequence modeling problem. While this paradigm can be effective at maximizing end-to-end performance, models are often difficult to interpret and expensive to train, since training requires unrolling sequences of predictions in time. As an alternative to end-to-end trained state estimation, we propose a novel particle filtering algorithm in which models are trained from individual state transitions, fully exploiting the Markov property in robotic systems. In this framework, measurement models are learned implicitly by minimizing a denoising score matching objective. At inference, the learned denoiser is used alongside a (learned) dynamics model to approximately solve the Bayesian filtering equation at each time step, effectively guiding predicted states toward the data manifold informed by measurements. We evaluate the proposed method on challenging robotic state estimation tasks in simulation, demonstrating competitive performance compared to tuned end-to-end trained baselines. Importantly, our method offers the desirable composability of classical filtering algorithms, allowing prior information and external sensor models to be incorporated without retraining.
