An Extended Horizon Tactical Decision-Making for Automated Driving Based on Monte Carlo Tree Search
Karim Essalmi, Fernando Garrido, Fawzi Nashashibi
TL;DR
The paper addresses long-horizon decision-making for automated driving where fixed planning horizons limit safety and efficiency, proposing COR-MCTS to extend horizon planning. It integrates Monte Carlo Tree Search with the COR-MP utility-based maneuver planner, using nodes that store $v$, $m$, $U$, and $UCB$, and an action set that combines lateral and longitudinal maneuvers. Through Selection, Expansion, Simulation, and Backpropagation steps, the method evaluates maneuver sequences and selects the action maximizing accumulated value under a budget of iterations, with $\gamma=0.9$ and $c=\sqrt{2}$. Simulations in two scenarios show COR-MCTS avoids unsafe decisions that plague fixed-horizon planners and achieves real-time performance, especially when pruning is used (median runtimes: $51.9$ ms for COR-MP, $113.88$ ms for pruned COR-MCTS, and $195.56$ ms for unpruned COR-MCTS). The work advances human-like, long-horizon tactical decision-making for autonomous driving and highlights trade-offs between horizon length, computation, and uncertainty, with plans for real-vehicle validation and interaction-aware extensions.
Abstract
This paper introduces COR-MCTS (Conservation of Resources - Monte Carlo Tree Search), a novel tactical decision-making approach for automated driving focusing on maneuver planning over extended horizons. Traditional decision-making algorithms are often constrained by fixed planning horizons, typically up to 6 seconds for classical approaches and 3 seconds for learning-based methods limiting their adaptability in particular dynamic driving scenarios. However, planning must be done well in advance in environments such as highways, roundabouts, and exits to ensure safe and efficient maneuvers. To address this challenge, we propose a hybrid method integrating Monte Carlo Tree Search (MCTS) with our prior utility-based framework, COR-MP (Conservation of Resources Model for Maneuver Planning). This combination enables long-term, real-time decision-making, significantly enhancing the ability to plan a sequence of maneuvers over extended horizons. Through simulations across diverse driving scenarios, we demonstrate that COR-MCTS effectively improves planning robustness and decision efficiency over extended horizons.
