Grounding Generative Planners in Verifiable Logic: A Hybrid Architecture for Trustworthy Embodied AI
Feiyu Wu, Xu Zheng, Yue Qu, Zhuocheng Wang, Zicheng Feng, Hui Li
TL;DR
This paper tackles the safety challenges of planning with stochastic Large Language Models in embodied AI by grounding a generative planner in a formal symbolic framework. It introduces the Verifiable Iterative Refinement Framework (VIRF), composed of a Logic Tutor (OWL 2 DL-based verifier), a rich perception pipeline (VLM-Cascade), and a scalable knowledge-acquisition workflow (Traceable Axiom Synthesis), enabling plan repair via a pedagogical dialogue rather than mere rejection. The approach delivers a perfect $0\%$ Hazardous Action Rate (HAR) and a high Task Efficacy with a $77.3\%$ Goal-Condition Rate (GCR) on SafeAgentBench, while maintaining efficiency through parallel verification and a tutoring loop that teaches the planner. By bridging evaluation blind spots with real-world safety axioms and demonstrating robustness to perception noise and planner scaling, VIRF provides a principled pathway toward trustworthy, verifiable autonomy in embodied agents.
Abstract
Large Language Models (LLMs) show promise as planners for embodied AI, but their stochastic nature lacks formal reasoning, preventing strict safety guarantees for physical deployment. Current approaches often rely on unreliable LLMs for safety checks or simply reject unsafe plans without offering repairs. We introduce the Verifiable Iterative Refinement Framework (VIRF), a neuro-symbolic architecture that shifts the paradigm from passive safety gatekeeping to active collaboration. Our core contribution is a tutor-apprentice dialogue where a deterministic Logic Tutor, grounded in a formal safety ontology, provides causal and pedagogical feedback to an LLM planner. This enables intelligent plan repairs rather than mere avoidance. We also introduce a scalable knowledge acquisition pipeline that synthesizes safety knowledge bases from real-world documents, correcting blind spots in existing benchmarks. In challenging home safety tasks, VIRF achieves a perfect 0 percent Hazardous Action Rate (HAR) and a 77.3 percent Goal-Condition Rate (GCR), which is the highest among all baselines. It is highly efficient, requiring only 1.1 correction iterations on average. VIRF demonstrates a principled pathway toward building fundamentally trustworthy and verifiably safe embodied agents.
