Dynamic Hybrid Modeling: Incremental Identification and Model Predictive Control
Adrian Caspari, Thomas Bierweiler, Sarah Fadda, Daniel Labisch, Maarten Nauta, Franzisko Wagner, Merle Warmbold, Constantinos C. Pantelides
TL;DR
This work tackles the identification of dynamic hybrid models that couple mechanistic DAEs with data-driven corrections by proposing an incremental identification framework. The method decouples the process into four steps: regularized dynamic parameter estimation to obtain time profiles $p(t)$, correlation analysis to select relevant inputs, data-driven model identification to map selected inputs to $p(t)$ via neural networks, and hybrid model integration that replaces $p(t)$ with the learned $ML(\cdot)$ within the DAE. The authors demonstrate robustness and efficiency through three case studies—a chemical reactor, a bioreactor, and a real research plant integrated with MPC—showing the approach can operate under limited data and complex mechanistic cores. The work provides a practical, diagnosable pathway to building accurate, deployable hybrid models with MPC-ready capabilities, while highlighting the importance of regularization and careful input selection for reliable data-driven components.
Abstract
Mathematical models are crucial for optimizing and controlling chemical processes, yet they often face significant limitations in terms of computational time, algorithm complexity, and development costs. Hybrid models, which combine mechanistic models with data-driven models (i.e. models derived via the application of machine learning to experimental data), have emerged as a promising solution to these challenges. However, the identification of dynamic hybrid models remains difficult due to the need to integrate data-driven models within mechanistic model structures. We present an incremental identification approach for dynamic hybrid models that decouples the mechanistic and data-driven components to overcome computational and conceptual difficulties. Our methodology comprises four key steps: (1) regularized dynamic parameter estimation to determine optimal time profiles for flux variables, (2) correlation analysis to evaluate relationships between variables, (3) data-driven model identification using advanced machine learning techniques, and (4) hybrid model integration to combine the mechanistic and data-driven components. This approach facilitates early evaluation of model structure suitability, accelerates the development of hybrid models, and allows for independent identification of data-driven components. Three case studies are presented to illustrate the robustness, reliability, and efficiency of our incremental approach in handling complex systems and scenarios with limited data.
