Simultaneous State Estimation and Online Model Learning in a Soft Robotic System
Jan-Hendrik Ewering, Max Bartholdt, Simon F. G. Ehlers, Niklas Wahlström, Thomas B. Schön, Thomas Seel
TL;DR
This work tackles simultaneous state estimation and online model learning for a nonlinear soft robotic system using a gray-box formulation. It couples a nominal constant-curvature model with a state-dependent bending stiffness learned online via a reduced-rank Gaussian Process, all integrated through a marginalized particle filter that samples states while marginalizing GP parameters. The method includes online GP hyperparameter learning and yields a posterior over states, stiffness, and GP coefficients, enabling improved multi-step forward predictions. Experiments on real soft-robot data show state estimates comparable to a UKF baseline and substantial gains in predictive accuracy when incorporating the online-learned bending-stiffness model, highlighting the approach’s potential for adaptive, predictive control in soft robotics.
Abstract
Operating complex real-world systems, such as soft robots, can benefit from precise predictive control schemes that require accurate state and model knowledge. This knowledge is typically not available in practical settings and must be inferred from noisy measurements. In particular, it is challenging to simultaneously estimate unknown states and learn a model online from sequentially arriving measurements. In this paper, we show how a recently proposed gray-box system identification tool enables the estimation of a soft robot's current pose while at the same time learning a bending stiffness model. For estimation and learning, we rely solely on a nominal constant-curvature robot model and measurements of the robot's base reactions (e.g., base forces). The estimation scheme -- relying on a marginalized particle filter -- allows us to conveniently interface nominal constant-curvature equations with a Gaussian Process (GP) bending stiffness model to be learned. This, in contrast to estimation via a random walk over stiffness values, enables prediction of bending stiffness and improves overall model quality. We demonstrate, using real-world soft-robot data, that the method learns a bending stiffness model online while accurately estimating the robot's pose. Notably, reduced multi-step forward-prediction errors indicate that the learned bending-stiffness GP improves overall model quality.
