Goal inference with Rao-Blackwellized Particle Filters
Yixuan Wang, Dan P. Guralnik, Warren E. Dixon
TL;DR
The paper advances intent inference for goal-directed motion by formulating a Rao-Blackwellized Particle Filter that leverages a conditionally linear–Gaussian structure to marginalize the state and sample only the intent parameters. It introduces density-based representations of intent and two estimators (complete and reduced), along with computable KL-divergence bounds that quantify information leakage to an adversary. Theoretical bounds on the difference between estimators are established, and numerical experiments in a 2D plane demonstrate rapid, robust recovery of the true goal with the reduced estimator performing nearly as well as the complete one. The work provides a principled framework for evaluating intent inference and motivates designing obfuscating controllers under KL-based leakage constraints for safer autonomy and privacy-aware systems.
Abstract
Inferring the eventual goal of a mobile agent from noisy observations of its trajectory is a fundamental estimation problem. We initiate the study of such intent inference using a variant of a Rao-Blackwellized Particle Filter (RBPF), subject to the assumption that the agent's intent manifests through closed-loop behavior with a state-of-the-art provable practical stability property. Leveraging the assumed closed-form agent dynamics, the RBPF analytically marginalizes the linear-Gaussian substructure and updates particle weights only, improving sample efficiency over a standard particle filter. Two difference estimators are introduced: a Gaussian mixture model using the RBPF weights and a reduced version confining the mixture to the effective sample. We quantify how well the adversary can recover the agent's intent using information-theoretic leakage metrics and provide computable lower bounds on the Kullback-Leibler (KL) divergence between the true intent distribution and RBPF estimates via Gaussian-mixture KL bounds. We also provide a bound on the difference in performance between the two estimators, highlighting the fact that the reduced estimator performs almost as well as the complete one. Experiments illustrate fast and accurate intent recovery for compliant agents, motivating future work on designing intent-obfuscating controllers.
