Joint Verification and Refinement of Language Models for Safety-Constrained Planning
Yunhao Yang, Neel P. Bhatt, William Ward, Zichao Hu, Joydeep Biswas, Ufuk Topcu
TL;DR
This work tackles the risk of LLM-generated robot programs violating safety specifications by introducing a formal verification pipeline that translates programs into finite-state automata and checks them against temporal logic safety properties. A key compositional verification theorem guarantees that any composition of individually verified subprograms remains safe, dramatically reducing verification complexity for long-horizon tasks. The authors accompany verification with a verification-guided fine-tuning procedure that uses verified subcomponents as supervision, achieving a 30% rise in specification-compliant programs and a ~50% reduction in training time. Empirical demonstrations on outdoor driving and indoor robots, along with out-of-domain validation and baselines, establish the practicality and efficiency of the approach for safer, scalable real-world robotic programming. The work lays groundwork for multimodal and human-in-the-loop extensions to further enhance safety in dynamic environments.
Abstract
Large language models possess impressive capabilities in generating programs (e.g., Python) from natural language descriptions to execute robotic tasks. However, these generated programs often contain errors that violate externally given task specifications. Without an effective method to verify their correctness, the reliable deployment of language models in real-world systems is practically infeasible. We develop a method that converts generated robot programs into an automaton-based representation and verifies them against task-relevant safety specifications. We establish a theorem that any arbitrary combination of the verified programs will also satisfy the safety specifications. Hence, the method eliminates the need to verify complex programs composed of multiple simpler ones, reducing computation complexity. We then introduce an automated fine-tuning procedure that leverages verification outcomes for supervision. By applying the theorem, this procedure only requires training the model to generate safe sub-components, thereby improving training efficiency. Empirical results on robot applications show a 30 percent increase in the probability of generating specification-compliant programs, with training time reduced by half compared to fine-tuning on generating full programs.
