Interpretable and Explainable Machine Learning Methods for Predictive Process Monitoring: A Systematic Literature Review
Nijat Mehdiyev, Maxim Majlatow, Peter Fettke
TL;DR
This systematic literature review addresses the challenge of understanding and applying explainable and interpretable ML in predictive process monitoring. Using PRISMA, it synthesizes 67 studies to distinguish intrinsically interpretable approaches from post-hoc explanations, maps application domains and datasets (notably BPIC), and analyzes evaluation practices. Key findings include a dominance of interpretability in some methods (e.g., decision trees) and a heavy reliance on post-hoc XAI for black-box models, with gaps in formal evaluation and real-world deployment. The work provides a structured foundation for practitioners and researchers to design transparent predictive process analytics and to establish more rigorous, multidimensional evaluation frameworks.
Abstract
This paper presents a systematic literature review (SLR) on the explainability and interpretability of machine learning (ML) models within the context of predictive process mining, using the PRISMA framework. Given the rapid advancement of artificial intelligence (AI) and ML systems, understanding the "black-box" nature of these technologies has become increasingly critical. Focusing specifically on the domain of process mining, this paper delves into the challenges of interpreting ML models trained with complex business process data. We differentiate between intrinsically interpretable models and those that require post-hoc explanation techniques, providing a comprehensive overview of the current methodologies and their applications across various application domains. Through a rigorous bibliographic analysis, this research offers a detailed synthesis of the state of explainability and interpretability in predictive process mining, identifying key trends, challenges, and future directions. Our findings aim to equip researchers and practitioners with a deeper understanding of how to develop and implement more trustworthy, transparent, and effective intelligent systems for predictive process analytics.
