CoDefeater: Using LLMs To Find Defeaters in Assurance Cases
Usman Gohar, Michael C. Hunter, Robyn R. Lutz, Myra B. Cohen
TL;DR
Assurance cases for safety-critical systems can be undermined by undetected defeaters. CoDefeater uses LLMs (GPT-3.5) in a zero-shot setting to automatically identify defeaters in two real-world assurance cases, supporting a human-in-the-loop workflow. The study provides empirical evidence that LLMs can identify most ground-truth defeaters and generate novel, feasible defeaters, and it contributes a defeater-rich assurance fragment for further research. These findings suggest a practical path to improve the completeness, soundness, and confidence of assurance cases and accelerate safety certification processes.
Abstract
Constructing assurance cases is a widely used, and sometimes required, process toward demonstrating that safety-critical systems will operate safely in their planned environment. To mitigate the risk of errors and missing edge cases, the concept of defeaters - arguments or evidence that challenge claims in an assurance case - has been introduced. Defeaters can provide timely detection of weaknesses in the arguments, prompting further investigation and timely mitigations. However, capturing defeaters relies on expert judgment, experience, and creativity and must be done iteratively due to evolving requirements and regulations. This paper proposes CoDefeater, an automated process to leverage large language models (LLMs) for finding defeaters. Initial results on two systems show that LLMs can efficiently find known and unforeseen feasible defeaters to support safety analysts in enhancing the completeness and confidence of assurance cases.
