Augmenting train maintenance technicians with automated incident diagnostic suggestions
Georges Tod, Jean Bruggeman, Evert Bevernage, Pieter Moelans, Walter Eeckhout, Jean-Luc Glineur
TL;DR
The paper addresses the need for rapid, reliable diagnostic support for train incidents by formulating incident causes as a multiclass classification task: $y=f(x)$ where $x=[x_1,\dots,x_p]$ is a sequence of on-board events and $y$ is the implicated physical subsystem. It proposes a cloud-based platform that ingests incident data, extracts recurrent event sets via feature engineering (including Longest-Common Sub Sequence mining), and uses a cascaded ensemble of Naive Bayes classifiers across time windows to generate near real-time diagnostic suggestions, with a human-in-the-loop feedback loop for continual improvement. Key contributions include (1) the automated diagnostics platform architecture, (2) a two-stage feature engineering pipeline that yields discrete event sets, and (3) a novel discrete set classifier that leverages cascading windows to balance accuracy and coverage. The approach demonstrates competitive predictive performance across multiple fleets, offering actionable explanations through the extracted event sets and enabling faster prioritization of repairs, with promising directions toward predictive maintenance alerts and edge deployment.
Abstract
Train operational incidents are so far diagnosed individually and manually by train maintenance technicians. In order to assist maintenance crews in their responsiveness and task prioritization, a learning machine is developed and deployed in production to suggest diagnostics to train technicians on their phones, tablets or laptops as soon as a train incident is declared. A feedback loop allows to take into account the actual diagnose by designated train maintenance experts to refine the learning machine. By formulating the problem as a discrete set classification task, feature engineering methods are proposed to extract physically plausible sets of events from traces generated on-board railway vehicles. The latter feed an original ensemble classifier to class incidents by their potential technical cause. Finally, the resulting model is trained and validated using real operational data and deployed on a cloud platform. Future work will explore how the extracted sets of events can be used to avoid incidents by assisting human experts in the creation predictive maintenance alerts.
