Interpretable Prognostics with Concept Bottleneck Models
Florent Forest, Katharina Rombach, Olga Fink
TL;DR
This work tackles the interpretability gap in prognostics by introducing Concept Bottleneck Models (CBMs) that predict RUL through high-level degradation-mode concepts. By using component degradation states as intermediate concepts, the authors explore CBMs, their extensions (CEMs, Hybrid CBMs), and test-time interventions, demonstrating competitive or superior performance on the N-CMAPSS turbofan dataset compared with non-interpretable baselines. Key findings show that CEMs achieve top RUL accuracy and concept fidelity, while test-time interventions can further improve predictions with minimal degradation of interpretability. The study highlights the practical value of interpretable prognostics for safety-critical assets and points to robust performance even with incomplete concept labeling, albeit with limitations around concept labeling requirements and deployment considerations.
Abstract
Deep learning approaches have recently been extensively explored for the prognostics of industrial assets. However, they still suffer from a lack of interpretability, which hinders their adoption in safety-critical applications. To improve their trustworthiness, explainable AI (XAI) techniques have been applied in prognostics, primarily to quantify the importance of input variables for predicting the remaining useful life (RUL) using post-hoc attribution methods. In this work, we propose the application of Concept Bottleneck Models (CBMs), a family of inherently interpretable neural network architectures based on concept explanations, to the task of RUL prediction. Unlike attribution methods, which explain decisions in terms of low-level input features, concepts represent high-level information that is easily understandable by users. Moreover, once verified in actual applications, CBMs enable domain experts to intervene on the concept activations at test-time. We propose using the different degradation modes of an asset as intermediate concepts. Our case studies on the New Commercial Modular AeroPropulsion System Simulation (N-CMAPSS) aircraft engine dataset for RUL prediction demonstrate that the performance of CBMs can be on par or superior to black-box models, while being more interpretable, even when the available labeled concepts are limited. Code available at \href{https://github.com/EPFL-IMOS/concept-prognostics/}{\url{github.com/EPFL-IMOS/concept-prognostics/}}.
