Towards Safe Maneuvering of Double-Ackermann-Steering Robots with a Soft Actor-Critic Framework
Kohio Deflesselle, Mélodie Daniel, Aly Magassouba, Miguel Aranda, Olivier Ly
TL;DR
This work tackles the problem of safe maneuvering for double-Ackermann-steering robots under non-holonomic constraints. It introduces a Deep Reinforcement Learning framework based on Soft Actor-Critic (SAC) with Hindsight Experience Replay (HER) and CrossQ, enabling obstacle avoidance without handcrafted trajectories. The state and action definitions, plus a sparse but safety-conscious reward, yield robust performance; in simulation the method reaches up to 97% success on seen targets and 95% on unseen targets, with improvements in path efficiency (SPL). The results suggest strong potential for real-world deployment, with future work aimed at validation in cluttered real environments.
Abstract
We present a deep reinforcement learning framework based on Soft Actor-Critic (SAC) for safe and precise maneuvering of double-Ackermann-steering mobile robots (DASMRs). Unlike holonomic or simpler non-holonomic robots such as differential-drive robots, DASMRs face strong kinematic constraints that make classical planners brittle in cluttered environments. Our framework leverages the Hindsight Experience Replay (HER) and the CrossQ overlay to encourage maneuvering efficiency while avoiding obstacles. Simulation results with a heavy four-wheel-steering rover show that the learned policy can robustly reach up to 97% of target positions while avoiding obstacles. Our framework does not rely on handcrafted trajectories or expert demonstrations.
