Run-Time Monitoring of ERTMS/ETCS Control Flow by Process Mining
Francesco Vitale, Tommaso Zoppi, Francesco Flammini, Nicola Mazzocca
TL;DR
The paper tackles the challenge of ensuring resilience in ERTMS/ETCS L2 by enabling run-time monitoring of control flow through process mining. It proposes a three-stage methodology that first learns a normative execution model via offline process discovery (Petri nets), then performs online conformance checking to generate diagnoses, and finally applies unsupervised clustering and explanation to localize anomalies to specific components. The RBC/RBC Handover scenario serves as a proof-of-concept, demonstrating that online conformance diagnoses can be effectively clustered and explained, achieving high accuracy and clear localization under appropriate window sizes. The work contributes to safer, more explainable railway software by bridging offline verification with online assurance and outlines future work on adaptive mitigation and digital twin integration. Overall, the approach shows promise for improving dependability in safety-critical rail systems through explainable, run-time anomaly detection and localization.
Abstract
Ensuring the resilience of computer-based railways is increasingly crucial to account for uncertainties and changes due to the growing complexity and criticality of those systems. Although their software relies on strict verification and validation processes following well-established best-practices and certification standards, anomalies can still occur at run-time due to residual faults, system and environmental modifications that were unknown at design-time, or other emergent cyber-threat scenarios. This paper explores run-time control-flow anomaly detection using process mining to enhance the resilience of ERTMS/ETCS L2 (European Rail Traffic Management System / European Train Control System Level 2). Process mining allows learning the actual control flow of the system from its execution traces, thus enabling run-time monitoring through online conformance checking. In addition, anomaly localization is performed through unsupervised machine learning to link relevant deviations to critical system components. We test our approach on a reference ERTMS/ETCS L2 scenario, namely the RBC/RBC Handover, to show its capability to detect and localize anomalies with high accuracy, efficiency, and explainability.
