Guiding Energy-Efficient Locomotion through Impact Mitigation Rewards
Chenghao Wang, Arjun Viswanathan, Eric Sihite, Alireza Ramezani
TL;DR
The paper addresses the gap between imitation-based gaits and the passive, energy-dissipative dynamics underlying efficient locomotion. It introduces the Impact Mitigation Factor (IMF) as a physics-informed reward that quantifies passive impact absorption and combines it with Adversarial Motion Priors (AMP) or handcrafted rewards to guide learning. Through simulation on Husky-v2, IMF-augmented policies achieve up to $32\%$ reductions in Cost of Transport ($CoT$) while also lowering peak mechanical power and joint torques across flat and rough terrains. This hybrid framework demonstrates that leveraging IMF enables policies to learn both explicit motion styles and the implicit dynamic principles of natural locomotion, with strong implications for energy-efficient, robust legged robots and future hardware validation.
Abstract
Animals achieve energy-efficient locomotion by their implicit passive dynamics, a marvel that has captivated roboticists for decades.Recently, methods incorporated Adversarial Motion Prior (AMP) and Reinforcement learning (RL) shows promising progress to replicate Animals' naturalistic motion. However, such imitation learning approaches predominantly capture explicit kinematic patterns, so-called gaits, while overlooking the implicit passive dynamics. This work bridges this gap by incorporating a reward term guided by Impact Mitigation Factor (IMF), a physics-informed metric that quantifies a robot's ability to passively mitigate impacts. By integrating IMF with AMP, our approach enables RL policies to learn both explicit motion trajectories from animal reference motion and the implicit passive dynamic. We demonstrate energy efficiency improvements of up to 32%, as measured by the Cost of Transport (CoT), across both AMP and handcrafted reward structure.
