Concept-Guided LLM Agents for Human-AI Safety Codesign
Florian Geissler, Karsten Roscher, Mario Trapp
TL;DR
This work tackles the challenge of using LLMs for safety analysis in software systems by introducing a concept-guided hybrid framework that verbalizes a structured system model into an intermediate representation and guides LLM reasoning through a cascade of micro-decisions. The LLM agent, built with LangChain, interacts with a graph-based system model and external tools (via retrieval-augmented generation and deterministic functions) to perform fault propagation, critical-path, single-point-of-failure, and replication analyses. A key contribution is the graph verbalization and a predefined-concept decision network that constrains tool triggers, improving safety and reliability over naïve CoT approaches. Demonstrated on a simplified automated driving architecture, the method yields accurate task identification and safe graph updates, offering a scalable, interactive Human-AI safety codesign framework with potential applicability to larger safety-critical domains.
Abstract
Generative AI is increasingly important in software engineering, including safety engineering, where its use ensures that software does not cause harm to people. This also leads to high quality requirements for generative AI. Therefore, the simplistic use of Large Language Models (LLMs) alone will not meet these quality demands. It is crucial to develop more advanced and sophisticated approaches that can effectively address the complexities and safety concerns of software systems. Ultimately, humans must understand and take responsibility for the suggestions provided by generative AI to ensure system safety. To this end, we present an efficient, hybrid strategy to leverage LLMs for safety analysis and Human-AI codesign. In particular, we develop a customized LLM agent that uses elements of prompt engineering, heuristic reasoning, and retrieval-augmented generation to solve tasks associated with predefined safety concepts, in interaction with a system model graph. The reasoning is guided by a cascade of micro-decisions that help preserve structured information. We further suggest a graph verbalization which acts as an intermediate representation of the system model to facilitate LLM-graph interactions. Selected pairs of prompts and responses relevant for safety analytics illustrate our method for the use case of a simplified automated driving system.
