Bio-Inspired Plastic Neural Networks for Zero-Shot Out-of-Distribution Generalization in Complex Animal-Inspired Robots
Binggwong Leung, Worasuchad Haomachai, Joachim Winther Pedersen, Sebastian Risi, Poramate Manoonpong
TL;DR
The paper tackles the brittleness of deep policy controllers in robotics under out-of-distribution (OOD) conditions and the sim-to-real gap by introducing a bio-inspired neural network with Hebbian plasticity and a weight-normalization mechanism. Trained with evolution strategies, the model is evaluated on complex 18-$DOF$ dung beetle-like and 16-$DOF$ gecko-like robots, demonstrating zero-shot sim-to-real locomotion and robust generalization to uneven terrain and morphological damage. A key contribution is the incorporation of two normalization schemes for plastic weight updates and the use of PCA to reveal the dynamic weight attractors that underlie adaptive behavior. The results show that Hebbian plasticity can yield robust, adaptable locomotion in real-world, high-DOF legged robots without terrain randomization, offering a promising alternative to more data-intensive domain randomization approaches.
Abstract
Artificial neural networks can be used to solve a variety of robotic tasks. However, they risk failing catastrophically when faced with out-of-distribution (OOD) situations. Several approaches have employed a type of synaptic plasticity known as Hebbian learning that can dynamically adjust weights based on local neural activities. Research has shown that synaptic plasticity can make policies more robust and help them adapt to unforeseen changes in the environment. However, networks augmented with Hebbian learning can lead to weight divergence, resulting in network instability. Furthermore, such Hebbian networks have not yet been applied to solve legged locomotion in complex real robots with many degrees of freedom. In this work, we improve the Hebbian network with a weight normalization mechanism for preventing weight divergence, analyze the principal components of the Hebbian's weights, and perform a thorough evaluation of network performance in locomotion control for real 18-DOF dung beetle-like and 16-DOF gecko-like robots. We find that the Hebbian-based plastic network can execute zero-shot sim-to-real adaptation locomotion and generalize to unseen conditions, such as uneven terrain and morphological damage.
