FaultExplainer: Leveraging Large Language Models for Interpretable Fault Detection and Diagnosis
Abdullah Khan, Rahul Nahar, Hao Chen, Gonzalo E. Constante Flores, Can Li
TL;DR
FaultExplainer presents an LLM-enabled framework for interpretable fault detection, diagnosis, and explanation in chemical processes by grounding LLM reasoning in PCA-based $T^2$ fault detection and a detailed Tennessee Eastman Process description. The method combines $T^2$ statistics, feature contribution analysis, and two targeted prompts to generate grounded root-cause explanations, evaluated on 15 TEP faults with GPT-4o and o1-preview. Results show plausible, actionable explanations in many cases, but also highlight failures due to PCA feature limitations and natural language model hallucinations, especially for unseen fault scenarios. The work demonstrates a practical, open-source tool for real-time fault monitoring and operator-oriented explanations, with clear avenues for advancing feature selection and domain-specific LLM training to improve reliability and interpretability in industrial settings.
Abstract
Machine learning algorithms are increasingly being applied to fault detection and diagnosis (FDD) in chemical processes. However, existing data-driven FDD platforms often lack interpretability for process operators and struggle to identify root causes of previously unseen faults. This paper presents FaultExplainer, an interactive tool designed to improve fault detection, diagnosis, and explanation in the Tennessee Eastman Process (TEP). FaultExplainer integrates real-time sensor data visualization, Principal Component Analysis (PCA)-based fault detection, and identification of top contributing variables within an interactive user interface powered by large language models (LLMs). We evaluate the LLMs' reasoning capabilities in two scenarios: one where historical root causes are provided, and one where they are not to mimic the challenge of previously unseen faults. Experimental results using GPT-4o and o1-preview models demonstrate the system's strengths in generating plausible and actionable explanations, while also highlighting its limitations, including reliance on PCA-selected features and occasional hallucinations.
