FD-LLM: Large Language Model for Fault Diagnosis of Machines
Hamzah A. A. M. Qaid, Bo Zhang, Dan Li, See-Kiong Ng, Wei Li
TL;DR
FD-LLM introduces a fault-diagnosis-specific large language model framework that treats fault prediction as a multi-class classification problem. It encodes time-series vibration data into text via FFT-based vectors or statistical feature summaries, supplemented by machine specifications and operation conditions in prompts, and fine-tunes open-source LLMs using LoRA. The approach achieves standout performance with Llama3 family on FFT representations, showing strong adaptability across operational conditions but limited cross-component generalization, and demonstrates that prompt-contextual information substantially boosts statistical-data performance. The work highlights the potential of LLMs for integrated textual and numerical diagnostics and suggests future directions in cross-component robustness and reasoning-enabled inference to further enhance reliability in industrial fault diagnosis.
Abstract
Large language models (LLMs) are effective at capturing complex, valuable conceptual representations from textual data for a wide range of real-world applications. However, in fields like Intelligent Fault Diagnosis (IFD), incorporating additional sensor data-such as vibration signals, temperature readings, and operational metrics-is essential but it is challenging to capture such sensor data information within traditional text corpora. This study introduces a novel IFD approach by effectively adapting LLMs to numerical data inputs for identifying various machine faults from time-series sensor data. We propose FD-LLM, an LLM framework specifically designed for fault diagnosis by formulating the training of the LLM as a multi-class classification problem. We explore two methods for encoding vibration signals: the first method uses a string-based tokenization technique to encode vibration signals into text representations, while the second extracts statistical features from both the time and frequency domains as statistical summaries of each signal. We assess the fault diagnosis capabilities of four open-sourced LLMs based on the FD-LLM framework, and evaluate the models' adaptability and generalizability under various operational conditions and machine components, namely for traditional fault diagnosis, cross-operational conditions, and cross-machine component settings. Our results show that LLMs such as Llama3 and Llama3-instruct demonstrate strong fault detection capabilities and significant adaptability across different operational conditions, outperforming state-of-the-art deep learning (DL) approaches in many cases.
