A Soft Robotic System Automatically Learns Precise Agile Motions Without Model Information
Simon Bachhuber, Alexander Pawluchin, Arka Pal, Ivo Boblan, Thomas Seel
TL;DR
The paper tackles the challenge of precisely controlling Pneumatic Soft Actuators (PSAs) despite nonlinearities and hysteresis. It introduces Automatic Neural ODE Control (ANODEC), a two-stage data-driven approach that first learns a neural ODE model of the plant from input-output data and then synthesizes a neural ODE feedback controller, both optimized via RK4 integration and Adam optimization. On a real PSA with hysteresis, ANODEC achieves agile, non-repetitive reference tracking using only 30 s of interaction data and outperforms a manually tuned PID baseline across multiple unseen reference signals and a second, gravity-affected setup. This demonstrates data-efficient, model-free automatic control design for practical soft robotics, enabling robust agile motions with minimal experimental interaction time. Future work includes extending to multi-DOF PSAs and automating hyperparameter tuning and data acquisition.
Abstract
Many application domains, e.g., in medicine and manufacturing, can greatly benefit from pneumatic Soft Robots (SRs). However, the accurate control of SRs has remained a significant challenge to date, mainly due to their nonlinear dynamics and viscoelastic material properties. Conventional control design methods often rely on either complex system modeling or time-intensive manual tuning, both of which require significant amounts of human expertise and thus limit their practicality. In recent works, the data-driven method, Automatic Neural ODE Control (ANODEC) has been successfully used to -- fully automatically and utilizing only input-output data -- design controllers for various nonlinear systems in silico, and without requiring prior model knowledge or extensive manual tuning. In this work, we successfully apply ANODEC to automatically learn to perform agile, non-repetitive reference tracking motion tasks in a real-world SR and within a finite time horizon. To the best of the authors' knowledge, ANODEC achieves, for the first time, performant control of a SR with hysteresis effects from only 30 seconds of input-output data and without any prior model knowledge. We show that for multiple, qualitatively different and even out-of-training-distribution reference signals, a single feedback controller designed by ANODEC outperforms a manually tuned PID baseline consistently. Overall, this contribution not only further strengthens the validity of ANODEC, but it marks an important step towards more practical, easy-to-use SRs that can automatically learn to perform agile motions from minimal experimental interaction time.
