On the Role of the Action Space in Robot Manipulation Learning and Sim-to-Real Transfer
Elie Aljalbout, Felix Frank, Maximilian Karl, Patrick van der Smagt
TL;DR
This work investigates how action-space design influences reinforcement learning for robot manipulation and sim-to-real transfer. It evaluates 13 action spaces—ranging from joint torque and impedance-based configurations to Cartesian, velocity-based, and delta-action variants—on two manipulation tasks (reaching and pushing) with a Franka Panda in simulation and on a real robot, using PPO for policy optimization. The study introduces metrics to quantify training efficiency, constraint violations, tracking feasibility, task accuracy, and the sim-to-real gap (OTE) and identifies that velocity-based and high-order-derivative action spaces generally transfer better, with delta-action spaces offering robustness. The findings provide concrete guidelines for action-space design to improve real-world deployment of RL policies in robotics.
Abstract
We study the choice of action space in robot manipulation learning and sim-to-real transfer. We define metrics that assess the performance, and examine the emerging properties in the different action spaces. We train over 250 reinforcement learning~(RL) agents in simulated reaching and pushing tasks, using 13 different control spaces. The choice of spaces spans combinations of common action space design characteristics. We evaluate the training performance in simulation and the transfer to a real-world environment. We identify good and bad characteristics of robotic action spaces and make recommendations for future designs. Our findings have important implications for the design of RL algorithms for robot manipulation tasks, and highlight the need for careful consideration of action spaces when training and transferring RL agents for real-world robotics.
