RAAD-LLM: Adaptive Anomaly Detection Using LLMs and RAG Integration
Alicia Russell-Gilbert, Sudip Mittal, Shahram Rahimi, Maria Seale, Joseph Jabour, Thomas Arnold, Joshua Church
TL;DR
RAAD-LLM tackles anomaly detection in predictive maintenance under data-sparse and evolving conditions by combining a frozen LLM with a Retrieval-Augmented Generation pipeline that leverages domain knowledge without dataset-specific fine-tuning. The framework adds an adaptability mechanism to dynamically update the normal baseline and enriches inputs with semantic context, enabling multimodal reasoning with plant operators. Empirical results on a plastics-manufacturing use-case and the SKAB benchmark show substantial accuracy gains over prior AAD-LLM, with RAAD-LLM achieving 88.6% accuracy on the real-world dataset and 71.6% on SKAB, alongside high F1 scores. RAAD-LLMv2 offers scalable retrieval via LlamaIndex, trading some accuracy for improved deployment in real-world, data-sparse environments. Overall, RAAD-LLM has the potential to shift anomaly detection practice in PdM by delivering transferable, interpretable, and context-aware decisions without heavy retraining.
Abstract
Anomaly detection in complex industrial environments poses unique challenges, particularly in contexts characterized by data sparsity and evolving operational conditions. Predictive maintenance (PdM) in such settings demands methodologies that are adaptive, transferable, and capable of integrating domain-specific knowledge. In this paper, we present RAAD-LLM, a novel framework for adaptive anomaly detection, leveraging large language models (LLMs) integrated with Retrieval-Augmented Generation (RAG). This approach addresses the aforementioned PdM challenges. By effectively utilizing domain-specific knowledge, RAAD-LLM enhances the detection of anomalies in time series data without requiring fine-tuning on specific datasets. The framework's adaptability mechanism enables it to adjust its understanding of normal operating conditions dynamically, thus increasing detection accuracy. We validate this methodology through a real-world application for a plastics manufacturing plant and the Skoltech Anomaly Benchmark (SKAB). Results show significant improvements over our previous model with an accuracy increase from 70.7% to 88.6% on the real-world dataset. By allowing for the enriching of input series data with semantics, RAAD-LLM incorporates multimodal capabilities that facilitate more collaborative decision-making between the model and plant operators. Overall, our findings support RAAD-LLM's ability to revolutionize anomaly detection methodologies in PdM, potentially leading to a paradigm shift in how anomaly detection is implemented across various industries.
