Hybrid Modeling, Sim-to-Real Reinforcement Learning, and Large Language Model Driven Control for Digital Twins
Adil Rasheed, Oscar Ravik, Omer San
TL;DR
This work evaluates a digital twin pipeline for a miniature greenhouse by comparing physics-based, data-driven, and hybrid HAM predictors across interpolation and extrapolation, and by testing Model Predictive Control, Reinforcement Learning, and Large Language Model-based controllers. HAM emerges as the best overall predictor in balance between accuracy, generalization, and computation, while LSTM achieves the highest precision at greater resource cost. Among controllers, MPC offers robust, predictable performance; RL demonstrates strong adaptability; LLM-based controllers provide flexible human–AI interaction when integrated with predictive surrogates. The study demonstrates sim-to-real transfer and discusses the practical implications of integrating LLMs for interpretable, tool-assisted control in digital twins, outlining directions for open-source deployment and broader applicability.
Abstract
This work investigates the use of digital twins for dynamical system modeling and control, integrating physics-based, data-driven, and hybrid approaches with both traditional and AI-driven controllers. Using a miniature greenhouse as a test platform, four predictive models Linear, Physics-Based Modeling (PBM), Long Short Term Memory (LSTM), and Hybrid Analysis and Modeling (HAM) are developed and compared under interpolation and extrapolation scenarios. Three control strategies Model Predictive Control (MPC), Reinforcement Learning (RL), and Large Language Model (LLM) based control are also implemented to assess trade-offs in precision, adaptability, and implementation effort. Results show that in modeling HAM provides the most balanced performance across accuracy, generalization, and computational efficiency, while LSTM achieves high precision at greater resource cost. Among controllers, MPC delivers robust and predictable performance, RL demonstrates strong adaptability, and LLM-based controllers offer flexible human-AI interaction when coupled with predictive tools.
