DPM-Bench: Benchmark for Distributed Process Mining Algorithms on Cyber-Physical Systems
Hendrik Reiter, Patrick Rathje, Olaf Landsiedel, Wilhelm Hasselbring
TL;DR
The paper addresses the limitations of centralized process mining in CPS by proposing Distributed Process Mining (DPM) and a formal CPS-adapted streaming model. It introduces DPM-Bench, a benchmark framework that uses Hardware Interaction Instructions (HIIs) to quantify costs and supports topology-aware evaluation across central, decentralized, and distributed PM configurations, demonstrated on a three-node setup with generated distributed event data. Key contributions include a formal taxonomy of PM topologies, an extended streaming model for CPS, and a public benchmark tool to compare DPM algorithms and topologies, providing insights into algorithmic behavior and infrastructure requirements. The framework enables engineers to assess hardware and network provisioning for DPM deployments and guides future research toward privacy-preserving, scalable distributed process mining in CPS.
Abstract
Process Mining is established in research and industry systems to analyze and optimize processes based on event data from information systems. Within this work, we accomodate process mining techniques to Cyber-Physical Systems. To capture the distributed and heterogeneous characteristics of data, computational resources, and network communication in CPS, the todays process mining algorithms and techniques must be augmented. Specifically, there is a need for new Distributed Process Mining algorithms that enable computations to be performed directly on edge resources, eliminating the need for moving all data to central cloud systems. This paper introduces the DPM-Bench benchmark for comparing such Distributed Process Mining algorithms. DPM-Bench is used to compare algorithms deployed in different computational topologies. The results enable information system engineers to assess whether the existing infrastructure is sufficient to perform distributed process mining, or to identify required improvements in algorithms and hardware. We present and discuss an experimental evaluation with DPM-Bench.
