Trustworthy and Explainable Deep Reinforcement Learning for Safe and Energy-Efficient Process Control: A Use Case in Industrial Compressed Air Systems
Vincent Bezold, Patrick Wagner, Jakob Hofmann, Marco Huber, Alexander Sauer
TL;DR
The paper tackles energy inefficiency and safety risks in industrial compressed air systems by introducing a trustworthy DRL controller for multi-compressor setups. It combines a PPO-based policy with a physics-informed simulation and a multi-level explainability pipeline (input perturbation, gradient sensitivity, SHAP—including time-resolved attributions) to ensure plausibility and transparency. Empirical results show the agent lowers average pressure, respects safety boundaries, and achieves about 4% energy savings without explicit physics models, with explanations consistently highlighting pressure and forecast inputs as primary drivers. This work provides a transferable framework for interpretable RL in energy-critical industrial processes and lays groundwork for safer real-world deployment.
Abstract
This paper presents a trustworthy reinforcement learning approach for the control of industrial compressed air systems. We develop a framework that enables safe and energy-efficient operation under realistic boundary conditions and introduce a multi-level explainability pipeline combining input perturbation tests, gradient-based sensitivity analysis, and SHAP (SHapley Additive exPlanations) feature attribution. An empirical evaluation across multiple compressor configurations shows that the learned policy is physically plausible, anticipates future demand, and consistently respects system boundaries. Compared to the installed industrial controller, the proposed approach reduces unnecessary overpressure and achieves energy savings of approximately 4\,\% without relying on explicit physics models. The results further indicate that system pressure and forecast information dominate policy decisions, while compressor-level inputs play a secondary role. Overall, the combination of efficiency gains, predictive behavior, and transparent validation supports the trustworthy deployment of reinforcement learning in industrial energy systems.
