Ensemble-Enhanced Graph Autoencoder with GAT and Transformer-Based Encoders for Robust Fault Diagnosis
Moirangthem Tiken Singh
TL;DR
The paper addresses robust fault diagnosis from vibration signals under varying operating conditions by proposing a graph-based framework that converts time-series data into a segment graph via entropy-based windowing and DTW-based edges. A Graph Autoencoder with a deep graph transformer encoder/decoder learns latent graph representations, augmented by an ensemble classifier to predict fault types. Across CWRU and HUST datasets, the model achieves near-perfect cross-domain performance (mean F1 around $0.99$ on CWRU) and demonstrates that richer, more diverse training data (3 HP) yields the strongest generalization. The approach highlights the importance of dataset diversity and provides a scalable, robust fault-diagnosis solution for industrial settings.
Abstract
Fault classification in industrial machinery is vital for enhancing reliability and reducing downtime, yet it remains challenging due to the variability of vibration patterns across diverse operating conditions. This study introduces a novel graph-based framework for fault classification, converting time-series vibration data from machinery operating at varying horsepower levels into a graph representation. We utilize Shannon's entropy to determine the optimal window size for data segmentation, ensuring each segment captures significant temporal patterns, and employ Dynamic Time Warping (DTW) to define graph edges based on segment similarity. A Graph Auto Encoder (GAE) with a deep graph transformer encoder, decoder, and ensemble classifier is developed to learn latent graph representations and classify faults across various categories. The GAE's performance is evaluated on the Case Western Reserve University (CWRU) dataset, with cross-dataset generalization assessed on the HUST dataset. Results show that GAE achieves a mean F1-score of 0.99 on the CWRU dataset, significantly outperforming baseline models-CNN, LSTM, RNN, GRU, and Bi-LSTM (F1-scores: 0.94-0.97, p < 0.05, Wilcoxon signed-rank test for Bi-LSTM: p < 0.05) -- particularly in challenging classes (e.g., Class 8: 0.99 vs. 0.71 for Bi-LSTM). Visualization of dataset characteristics reveals that datasets with amplified vibration patterns and diverse fault dynamics enhance generalization. This framework provides a robust solution for fault diagnosis under varying conditions, offering insights into dataset impacts on model performance.
