Interpretable Rules for Online Failure Prediction: A Case Study on the Metro do Porto dataset
Matthias Jakobs, Bruno Veloso, Joao Gama
TL;DR
This work tackles interpretable online failure prediction for Metro do Porto trains by coupling a Convolutional Autoencoder-based failure detector with an online rule-learning pipeline that derives both local and global, easily interpretable rules. Time-windowed sensor data are transformed into concise features (e.g., variances, min/max/mean) and used to train decision trees that explain detected failures, with a strong emphasis on a binary failure probability $p_t(\text{failure})$ and smoothing via $\alpha$ and $\tau_{\text{fail}}$. The key finding is that a single sensor, Flowmeter, is highly predictive for both failures, yielding simple thresholds that trigger alarms well before the LPS signal, and alternative cheaper sensors can still produce valid explanations when Flowmeter is unavailable. The study demonstrates that MetroPT2 is not extremely challenging when Flowmeter is present, but highlights the need to test on MetroPT3 for more robust, real-world applicability, and discusses limitations such as infinite possible rules and unbounded historical data.
Abstract
Due to their high predictive performance, predictive maintenance applications have increasingly been approached with Deep Learning techniques in recent years. However, as in other real-world application scenarios, the need for explainability is often stated but not sufficiently addressed. This study will focus on predicting failures on Metro trains in Porto, Portugal. While recent works have found high-performing deep neural network architectures that feature a parallel explainability pipeline, the generated explanations are fairly complicated and need help explaining why the failures are happening. This work proposes a simple online rule-based explainability approach with interpretable features that leads to straightforward, interpretable rules. We showcase our approach on MetroPT2 and find that three specific sensors on the Metro do Porto trains suffice to predict the failures present in the dataset with simple rules.
