Perception with Guarantees: Certified Pose Estimation via Reachability Analysis
Tobias Ladner, Yasser Shoukry, Matthias Althoff
TL;DR
This work tackles the challenge of obtaining formally guaranteed 3D pose estimates from vision data in safety-critical settings by combining reachability-based image enclosures with formal neural-network verification. By modeling uncertainties with matrix polynomial zonotopes and leveraging a preprocessing-online refinement pipeline, the method yields a certified pose set $\mathcal{E}^*$ that contains the true pose $\xi^*$ given an image $I^*$ and a known target geometry $\mathcal{T}$. The key contributions are (i) sound image enclosure from uncertain poses, (ii) a two-phase image-to-pose procedure with offline candidate precomputation and online refinement via witness pixels and preimage enclosures, and (iii) extensive experiments on synthetic and real data demonstrating tight certificates and practical runtime. The approach advances safety guarantees for vision-based perception in CPS, enabling certified localization even when external sensors or services are unreliable, and lays groundwork for real-time implementations with further optimization and hardware acceleration.
Abstract
Agents in cyber-physical systems are increasingly entrusted with safety-critical tasks. Ensuring safety of these agents often requires localizing the pose for subsequent actions. Pose estimates can, e.g., be obtained from various combinations of lidar sensors, cameras, and external services such as GPS. Crucially, in safety-critical domains, a rough estimate is insufficient to formally determine safety, i.e., guaranteeing safety even in the worst-case scenario, and external services might additionally not be trustworthy. We address this problem by presenting a certified pose estimation in 3D solely from a camera image and a well-known target geometry. This is realized by formally bounding the pose, which is computed by leveraging recent results from reachability analysis and formal neural network verification. Our experiments demonstrate that our approach efficiently and accurately localizes agents in both synthetic and real-world experiments.
