SATA: Safe and Adaptive Torque-Based Locomotion Policies Inspired by Animal Learning

Peizhuo Li; Hongyi Li; Ge Sun; Jin Cheng; Xinrong Yang; Guillaume Bellegarda; Milad Shafiee; Yuhong Cao; Auke Ijspeert; Guillaume Sartoretti

SATA: Safe and Adaptive Torque-Based Locomotion Policies Inspired by Animal Learning

Peizhuo Li, Hongyi Li, Ge Sun, Jin Cheng, Xinrong Yang, Guillaume Bellegarda, Milad Shafiee, Yuhong Cao, Auke Ijspeert, Guillaume Sartoretti

TL;DR

SATA tackles safety concerns in legged locomotion by delivering torque-based policies that directly drive actuators. It combines a biomechanical model with a growth-based training schedule to improve exploration, stability, and zero-shot sim-to-real transfer, enabling compliant interaction with humans and deformable terrains. The approach yields high compliance, robustness to disturbances, and reliable deployment without fine-tuning, outperforming baselines across challenging scenarios. This work demonstrates that physics-aware, growth-driven torque control can surpass traditional position-based methods in safety-critical, real-world contexts.

Abstract

Despite recent advances in learning-based controllers for legged robots, deployments in human-centric environments remain limited by safety concerns. Most of these approaches use position-based control, where policies output target joint angles that must be processed by a low-level controller (e.g., PD or impedance controllers) to compute joint torques. Although impressive results have been achieved in controlled real-world scenarios, these methods often struggle with compliance and adaptability when encountering environments or disturbances unseen during training, potentially resulting in extreme or unsafe behaviors. Inspired by how animals achieve smooth and adaptive movements by controlling muscle extension and contraction, torque-based policies offer a promising alternative by enabling precise and direct control of the actuators in torque space. In principle, this approach facilitates more effective interactions with the environment, resulting in safer and more adaptable behaviors. However, challenges such as a highly nonlinear state space and inefficient exploration during training have hindered their broader adoption. To address these limitations, we propose SATA, a bio-inspired framework that mimics key biomechanical principles and adaptive learning mechanisms observed in animal locomotion. Our approach effectively addresses the inherent challenges of learning torque-based policies by significantly improving early-stage exploration, leading to high-performance final policies. Remarkably, our method achieves zero-shot sim-to-real transfer. Our experimental results indicate that SATA demonstrates remarkable compliance and safety, even in challenging environments such as soft/slippery terrain or narrow passages, and under significant external disturbances, highlighting its potential for practical deployments in human-centric and safety-critical scenarios.

SATA: Safe and Adaptive Torque-Based Locomotion Policies Inspired by Animal Learning

TL;DR

Abstract

SATA: Safe and Adaptive Torque-Based Locomotion Policies Inspired by Animal Learning

TL;DR

Abstract

Paper Structure

Table of Contents

Figures (12)