Learning feasible transitions for efficient contact planning
Rikhat Akizhanov, Victor Dhédin, Majid Khadiv, Ivan Laptev
TL;DR
This work tackles dynamic contact planning for quadrupedal locomotion in extremely constrained stepping-stone environments by marrying Monte Carlo Tree Search with learned components. A dynamic feasibility classifier $c$ and a target adjustment network $t$ are trained offline from NMPC data to prune infeasible transitions and compensate low-level controller inaccuracies, enabling faster online search. Empirical results on a Go2 quadruped show that dynamic pruning and target adjustment substantially reduce NMPC calls and search time while increasing success rates, outperforming a state-of-the-art RL approach on the smallest stones. The approach demonstrates that integrating learning-based pruning and control compensation into a model-based planning loop yields robust, efficient navigation in sparse terrains with potential for real-robot deployment.
Abstract
In this paper, we propose an efficient contact planner for quadrupedal robots to navigate in extremely constrained environments such as stepping stones. The main difficulty in this setting stems from the mixed nature of the problem, namely discrete search over the steppable patches and continuous trajectory optimization. To speed up the discrete search, we study the properties of the transitions from one contact mode to another. In particular, we propose to learn a dynamic feasibility classifier and a target adjustment network. The former predicts if a contact transition between two contact modes is dynamically feasible. The latter is trained to compensate for misalignment in reaching a desired set of contact locations, due to imperfections of the low-level control. We integrate these learned networks in a Monte Carlo Tree Search (MCTS) contact planner. Our simulation results demonstrate that training these networks with offline data significantly speeds up the online search process and improves its accuracy.
