SAFE-SMART: Safety Analysis and Formal Evaluation using STL Metrics for Autonomous RoboTs

Kristy Sakano; Jianyu An; Dinesh Manocha; Huan Xu

SAFE-SMART: Safety Analysis and Formal Evaluation using STL Metrics for Autonomous RoboTs

Kristy Sakano, Jianyu An, Dinesh Manocha, Huan Xu

TL;DR

This work tackles the challenge of safely evaluating learning-based, black-box autonomous robots by introducing SAFE-SMART, a regulator-driven framework that translates human safety rules into STL specifications and uses rollout traces verified by TeLEx to compute TRV and LRV. The approach is model-agnostic and provides actionable feedback to designers for iterative retraining, demonstrated on two distinct domains: a virtual Mario Kart driving task and TurtleBot3 obstacle navigation, including real-world validation. Results show significant improvements in safety-rule adherence after retraining, with statistically robust gains across all rules in both scenarios and evidence of practical impact through real-world demonstrations. The study highlights the potential and limitations of regulator-guided STL verification for scalable, formal safety guarantees in learning-based autonomous robots.

Abstract

We present a novel, regulator-driven approach for post hoc safety evaluation of learning-based, black-box autonomous mobile robots, ensuring ongoing compliance with evolving, human-defined safety rules. In our iterative workflow, human safety requirements are translated by regulators into Signal Temporal Logic (STL) specifications. Rollout traces from the black-box model are externally verified for compliance, yielding quantitative safety metrics, Total Robustness Value (TRV) and Largest Robustness Value (LRV), which measure average and worst-case specification adherence. These metrics inform targeted retraining and iterative improvement by model designers. We apply our method across two different applications: a virtual driving scenario and an autonomous mobile robot navigating a complex environment, and observe statistically significant improvements across both scenarios. In the virtual driving scenario, we see a 177% increase in traces adhering to the simulation speed limit, a 1138% increase in traces minimizing off-road driving, and a 16% increase in traces successfully reaching the goal within the time limit. In the autonomous navigation scenario, there is a 300% increase in traces avoiding sharp turns, a 200% increase in traces reaching the goal within the time limit, and a 49% increase in traces minimizing time spent near obstacles. Finally, we validate our approach on a TurtleBot3 robot in the real world, and demonstrate improved obstacle navigation with safety buffers.

SAFE-SMART: Safety Analysis and Formal Evaluation using STL Metrics for Autonomous RoboTs

TL;DR

Abstract

SAFE-SMART: Safety Analysis and Formal Evaluation using STL Metrics for Autonomous RoboTs

TL;DR

Abstract

Paper Structure

Table of Contents

Figures (7)