Towards Transparent and Efficient Anomaly Detection in Industrial Processes through ExIFFI
Davide Frizzo, Francesco Borsatti, Alessio Arcudi, Antonio De Moliner, Roberto Oboe, Gian Antonio Susto
TL;DR
This work tackles the need for transparent anomaly detection in industrial processes by presenting ExIFFI, a fast, model-specific explanation framework for Extended Isolation Forest (EIF) and its extension EIFp. ExIFFI provides both Global Feature Importance ($GFI$) and Local Feature Importance ($LFI$) by leveraging EIF’s tree structure, with per-node and per-tree contributions aggregated across the forest and normalized to correct biases. On three industrial datasets, ExIFFI delivers high anomaly detection performance (e.g., averaging over 90% AP) while outperforming post-hoc XAI methods in feature-selection proxy tasks, and it achieves substantial speed advantages over alternatives like KernelSHAP. The approach supports real-time, human-centered decision-making in Industry 5.0 by delivering efficient, interpretable explanations that align with domain insight, though it remains tied to EIF-based models and may face challenges with clustered anomalies.
Abstract
Anomaly Detection (AD) is crucial in industrial settings to streamline operations by detecting underlying issues. Conventional methods merely label observations as normal or anomalous, lacking crucial insights. In Industry 5.0, interpretable outcomes become desirable to enable users to understand the rational under model decisions. This paper presents the first industrial application of ExIFFI, a recent approach for fast, efficient explanations for the Extended Isolation Forest (EIF) AD method. ExIFFI is tested on three industrial datasets, demonstrating superior explanation effectiveness, computational efficiency and improved raw anomaly detection performances. ExIFFI reaches over then 90\% of average precision on all the benchmarks considered in the study and overperforms state-of-the-art Explainable Artificial Intelligence (XAI) approaches in terms of the feature selection proxy task metric which was specifically introduced to quantitatively evaluate model explanations.
