Physics-Informed Neural Networks with Hard Linear Equality Constraints
Hao Chen, Gonzalo E. Constante Flores, Can Li
TL;DR
The paper introduces KKT-hPINN, a physics-informed neural network that guarantees hard linear equality constraints by embedding two non-trainable projection layers derived from Karush-Kuhn-Tucker conditions. This approach solves a small quadratic program to project predictions onto the feasible constraint set, ensuring exact feasibility during both training and inference without extra hyperparameters or post-processing. Across three Aspen Plus–based case studies (CSTR unit, DME-DEE plant, extractive distillation subsystem), KKT-hPINN consistently achieves lower RMSE and near-zero constraint violations compared to unconstrained neural networks and soft-constraint PINNs, demonstrating improved accuracy and robustness, even with reduced training data. The work highlights the method’s applicability as a high-fidelity, physically consistent surrogate modeling tool for process systems engineering, with potential to mitigate error propagation from constraint violations in large-scale integrations.
Abstract
Surrogate modeling is used to replace computationally expensive simulations. Neural networks have been widely applied as surrogate models that enable efficient evaluations over complex physical systems. Despite this, neural networks are data-driven models and devoid of any physics. The incorporation of physics into neural networks can improve generalization and data efficiency. The physics-informed neural network (PINN) is an approach to leverage known physical constraints present in the data, but it cannot strictly satisfy them in the predictions. This work proposes a novel physics-informed neural network, KKT-hPINN, which rigorously guarantees hard linear equality constraints through projection layers derived from KKT conditions. Numerical experiments on Aspen models of a continuous stirred-tank reactor (CSTR) unit, an extractive distillation subsystem, and a chemical plant demonstrate that this model can further enhance the prediction accuracy.
