ATRos: Learning Energy-Efficient Agile Locomotion for Wheeled-legged Robots
Jingyuan Sun, Hongyu Ji, Zihan Qu, Chaoran Wang, Mingyu Zhang
TL;DR
The paper tackles the challenge of energy-efficient, agile locomotion for wheeled-legged robots by introducing ATRos, an RL-based framework that combines a Predictive Response Policy Network (PRPN) with a PPO-based actor–critic to coordinate wheels and legs using proprioceptive inputs. It formulates the problem as a partially observable Markov decision process and employs an energy-aware reward to encourage efficient, stable, terrain-adaptive motion, achieving zero-shot sim-to-real transfer. The authors validate ATRos in simulation and on a real Go2W platform across grass and stair terrains, demonstrating improved energy efficiency and robust, agile performance with diverse terrain generalization. The work advances practical hybrid locomotion by balancing rolling efficiency and legged versatility, with implications for long-duration missions in real-world environments.
Abstract
Hybrid locomotion of wheeled-legged robots has recently attracted increasing attention due to their advantages of combining the agility of legged locomotion and the efficiency of wheeled motion. But along with expanded performance, the whole-body control of wheeled-legged robots remains challenging for hybrid locomotion. In this paper, we present ATRos, a reinforcement learning (RL)-based hybrid locomotion framework to achieve hybrid walking-driving motions on the wheeled-legged robot. Without giving predefined gait patterns, our planner aims to intelligently coordinate simultaneous wheel and leg movements, thereby achieving improved terrain adaptability and improved energy efficiency. Based on RL techniques, our approach constructs a prediction policy network that could estimate external environmental states from proprioceptive sensory information, and the outputs are then fed into an actor critic network to produce optimal joint commands. The feasibility of the proposed framework is validated through both simulations and real-world experiments across diverse terrains, including flat ground, stairs, and grassy surfaces. The hybrid locomotion framework shows robust performance over various unseen terrains, highlighting its generalization capability.
