Are GNNs Actually Effective for Multimodal Fault Diagnosis in Microservice Systems?
Fei Gao, Ruyue Xin, Xiaocui Li, Yaqiang Zhang
TL;DR
The paper tackles whether graph neural networks are truly beneficial for fault diagnosis in microservice systems by introducing DiagMLP, a topology-agnostic baseline that preserves multimodal fusion while removing explicit graph modeling. Through ablations across five datasets, DiagMLP matches or surpasses state-of-the-art GNN-based methods on fault detection, localization, and classification, suggesting that preprocessing and embedding pipelines already encode critical dependency information. The authors visualize embeddings to show that topology information adds little discriminative power beyond the rich multimodal features, challenging the assumed value of graph structures. They advocate rigorous baselines, standardized preprocessing, and larger, more explicit datasets to accurately evaluate architectural innovations in fault-diagnosis research.
Abstract
Graph Neural Networks (GNNs) are widely adopted for fault diagnosis in microservice systems, premised on their ability to model service dependencies. However, the necessity of explicit graph structures remains underexamined, as existing evaluations conflate preprocessing with architectural contributions. To isolate the true value of GNNs, we propose DiagMLP, a deliberately minimal, topology-agnostic baseline that retains multimodal fusion capabilities while excluding graph modeling. Through ablation experiments across five datasets, DiagMLP achieves performance parity with state-of-the-art GNN-based methods in fault detection, localization, and classification. These findings challenge the prevailing assumption that graph structures are indispensable, revealing that: (i) preprocessing pipelines already encode critical dependency information, and (ii) GNN modules contribute marginally beyond multimodality fusion. Our work advocates for systematic re-evaluation of architectural complexity and highlights the need for standardized baseline protocols to validate model innovations.
