Local Path Planning among Pushable Objects based on Reinforcement Learning
Linghong Yao, Valerio Modugno, Andromachi Maria Delfaki, Yuanchang Liu, Danail Stoyanov, Dimitrios Kanoulas
TL;DR
The paper tackles local path planning among pushable obstacles (NAMO) by learning a non-axis-aligned pushing policy via Advantage Actor-Critic in parallel simulated agents using NVIDIA Isaac Gym, with domain randomization to bridge sim-to-real gaps. The state combines a semantic occupancy grid and a feature vector; the network outputs both value and continuous actions $(v_x, \\dot{\theta})$ through a shared backbone, trained with a clipped surrogate objective and entropy regularization. A curriculum learning strategy across eight map layouts and robust sim-to-real transfer are demonstrated, showing high success in varied simulations (up to 91% in single-map, ~80% across maps, 54% in unseen maps) and successful real-world tests on a Unitree Go1. The work highlights non-linear obstacle manipulation and the potential to integrate with global planners like A* to enhance navigation in cluttered, uncertain environments.
Abstract
In this paper, we introduce a method to deal with the problem of robot local path planning among pushable objects -- an open problem in robotics. In particular, we achieve that by training multiple agents simultaneously in a physics-based simulation environment, utilizing an Advantage Actor-Critic algorithm coupled with a deep neural network. The developed online policy enables these agents to push obstacles in ways that are not limited to axial alignments, adapt to unforeseen changes in obstacle dynamics instantaneously, and effectively tackle local path planning in confined areas. We tested the method in various simulated environments to prove the adaptation effectiveness to various unseen scenarios in unfamiliar settings. Moreover, we have successfully applied this policy on an actual quadruped robot, confirming its capability to handle the unpredictability and noise associated with real-world sensors and the inherent uncertainties present in unexplored object pushing tasks.
