Learning Arm-Assisted Fall Damage Reduction and Recovery for Legged Mobile Manipulators
Yuntao Ma, Farbod Farshidian, Marco Hutter
TL;DR
The paper tackles fall damage and recovery for legged mobile manipulators by learning an arm-assisted recovery policy. It introduces an asymmetric actor-critic framework with time-varying task rewards within a finite-horizon MDP, enabling a time-invariant policy that uses the arm to reduce impact and aid self-righting. In simulation and on ALMA hardware, the approach reduces base impulse, base acceleration, and peak joint forces during falls, and achieves a high fall-recovery success rate (98.9%) with notable leg-torque savings compared to arm-tugged baselines. The method demonstrates robustness to reward scaling, adaptability to additional tasks (resting, self-righting), and practical sim-to-real transfer, advancing the deployability of legged mobile manipulators with payloads.
Abstract
Adaptive falling and recovery skills greatly extend the applicability of robot deployments. In the case of legged mobile manipulators, the robot arm could adaptively stop the fall and assist the recovery. Prior works on falling and recovery strategies for legged mobile manipulators usually rely on assumptions such as inelastic collisions and falling in defined directions to enable real-time computation. This paper presents a learning-based approach to reducing fall damage and recovery. An asymmetric actor-critic training structure is used to train a time-invariant policy with time-varying reward functions. In simulated experiments, the policy recovers from 98.9\% of initial falling configurations. It reduces base contact impulse, peak joint internal forces, and base acceleration during the fall compared to the baseline methods. The trained control policy is deployed and extensively tested on the ALMA robot hardware. A video summarizing the proposed method and the hardware tests is available at https://youtu.be/avwg2HqGi8s.
