Accounts of using the Tustin-Net architecture on a rotary inverted pendulum
Stijn van Esch, Fabio Bonassi, Thomas B. Schön
TL;DR
The paper investigates identifying a rotary inverted pendulum using a physics-informed neural network, the Tustin-Net, and compares it to a first-principles Euler–Lagrange grey-box model on a real Quanser Qube Servo 2. It identifies limitations of standard Tustin-Net training and proposes a transfer-learning workflow—pre-training on transient data, freezing early layers, and fine-tuning final layers—to boost accuracy, especially under imbalanced data. Results show that a TL-trained Tustin-Net achieves RMSEs comparable to or better than the Euler–Lagrange model and with less reliance on detailed physical knowledge, demonstrating practical viability for data-driven mechanical system identification. The approach enhances generalization across operating conditions and offers a scalable path for physics-based deep learning in real hardware settings.
Abstract
In this report we investigate the use of the Tustin neural network architecture (Tustin-Net) for the identification of a physical rotary inverse pendulum. This physics-based architecture is of particular interest as it builds on the known relationship between velocities and positions. We here aim at discussing the advantages, limitations and performance of Tustin-Nets compared to first-principles grey-box models on a real physical apparatus, showing how, with a standard training procedure, the former can hardly achieve the same accuracy as the latter. To address this limitation, we present a training strategy based on transfer learning that yields Tustin-Nets that are competitive with the first-principles model, without requiring extensive knowledge of the setup as the latter.
