Learning Stack-of-Tasks Management for Redundant Robots
Alessandro Adami, Aris Synodinos, Matteo Iovino, Ruggero Carli, Pietro Falco
TL;DR
This work tackles redundancy management in high-DOF robots by automatically learning complete Stack-of-Tasks structures directly from a user-defined cost. It introduces Genetic Programming guided by sim-to-real episodic evaluation to optimize the discrete task ordering, activation flags, and continuous gains, yielding interpretable, expert-like hierarchies without manual tuning. The approach demonstrates robust sim-to-real transfer on a dual-arm mobile robot (mobile-YuMi), with reliable obstacle avoidance and high tracking accuracy in static and dynamic environments, even in the presence of distractors. Overall, the framework offers a scalable, user-driven alternative to hand-crafted SoT design, capable of adapting task hierarchies to multi-objective trade-offs while maintaining safety and performance.
Abstract
This paper presents a novel framework for automatically learning complete Stack-of-Tasks (SoT) controllers for redundant robotic systems, including task priorities, activation logic, and control parameters. Unlike classical SoT pipelines-where task hierarchies are manually defined and tuned-our approach optimizes the full SoT structure directly from a user-specified cost function encoding intuitive preferences such as safety, precision, manipulability, or execution speed. The method combines Genetic Programming with simulation-based evaluation to explore both discrete (priority order, task activation) and continuous (gains, trajectory durations) components of the controller. We validate the framework on a dual-arm mobile manipulator (the ABB mobile-YuMi research platform), demonstrating robust convergence across multiple cost definitions, automatic suppression of irrelevant tasks, and strong resilience to distractors. Learned SoTs exhibit expert-like hierarchical structure and adapt naturally to multi-objective trade-offs. Crucially, all controllers transfer from Gazebo simulation to the real robot, achieving safe and precise motion without additional tuning. Experiments in static and dynamic environments show reliable obstacle avoidance, high tracking accuracy, and predictable behavior in the presence of humans. The proposed method provides an interpretable and scalable alternative to manual SoT design, enabling rapid, user-driven generation of task execution hierarchies for complex robotic systems.
