Real-Time Machine-Learning-Based Optimization Using Input Convex Long Short-Term Memory Network
Zihao Wang, Donghan Yu, Zhe Wu
TL;DR
The paper tackles the slow, non-convex nature of neural network–based optimization for real-time control by introducing Input Convex LSTM (IC-LSTM), which enforces convexity with respect to inputs through architectural constraints and input expansion. It provides theoretical convexity guarantees and a practical implementation pathway, including a TensorFlow/Keras IC-LSTM cell and a skip-connection design, then validates the approach in two real-world case studies: a Solar PV energy system and a CSTR. Results show IC-LSTM offers substantial runtime speedups (e.g., at least 4x faster than conventional LSTM in the solar PV MPC) while maintaining acceptable closed-loop performance, though some loss in modeling fidelity due to convexification is observed. Collectively, the work demonstrates a viable route to real-time, neural-network–based optimization for energy and chemical processes, with public code to facilitate adoption and further development.
Abstract
Neural network-based optimization and control methods, often referred to as black-box approaches, are increasingly gaining attention in energy and manufacturing systems, particularly in situations where first-principles models are either unavailable or inaccurate. However, their non-convex nature significantly slows down the optimization and control processes, limiting their application in real-time decision-making processes. To address this challenge, we propose a novel Input Convex Long Short-Term Memory (IC-LSTM) network to enhance the computational efficiency of neural network-based optimization. Through two case studies employing real-time neural network-based optimization for optimizing energy and chemical systems, we demonstrate the superior performance of IC-LSTM-based optimization in terms of runtime. Specifically, in a real-time optimization problem of a real-world solar photovoltaic energy system at LHT Holdings in Singapore, IC-LSTM-based optimization achieved at least 4-fold speedup compared to conventional LSTM-based optimization. These results highlight the potential of IC-LSTM networks to significantly enhance the efficiency of neural network-based optimization and control in practical applications. Source code is available at https://github.com/killingbear999/ICLSTM.
