Locomotion Generation for a Rat Robot based on Environmental Changes via Reinforcement Learning
Xinhui Shan, Yuhong Huang, Zhenshan Bing, Zitao Zhang, Xiangtong Yao, Kai Huang, Alois Knoll
TL;DR
This paper tackles locomotion generation for a small rat-like quadruped (NeRmo) with limited sensor feedback by leveraging reinforcement learning augmented with frequency-domain perception. It extracts robust environmental cues from IMU data using FFT, filters to the gait-relevant band, and represents environmental changes with a simplified two-sine model, reducing noise and computation. A multifunctional reward framework ties fall penalties and axis-specific desirables to guide adaptive gaits across ramps, stairs, and spiral stairs, learned via PPO without pre-training. Experiments show rapid convergence (≈0.25M steps) and high success rates across several environments, highlighting the approach's potential for robust, sensor-efficient locomotion in small robots.
Abstract
This research focuses on developing reinforcement learning approaches for the locomotion generation of small-size quadruped robots. The rat robot NeRmo is employed as the experimental platform. Due to the constrained volume, small-size quadruped robots typically possess fewer and weaker sensors, resulting in difficulty in accurately perceiving and responding to environmental changes. In this context, insufficient and imprecise feedback data from sensors makes it difficult to generate adaptive locomotion based on reinforcement learning. To overcome these challenges, this paper proposes a novel reinforcement learning approach that focuses on extracting effective perceptual information to enhance the environmental adaptability of small-size quadruped robots. According to the frequency of a robot's gait stride, key information of sensor data is analyzed utilizing sinusoidal functions derived from Fourier transform results. Additionally, a multifunctional reward mechanism is proposed to generate adaptive locomotion in different tasks. Extensive simulations are conducted to assess the effectiveness of the proposed reinforcement learning approach in generating rat robot locomotion in various environments. The experiment results illustrate the capability of the proposed approach to maintain stable locomotion of a rat robot across different terrains, including ramps, stairs, and spiral stairs.
