Monte Carlo Search Algorithms Discovering Monte Carlo Tree Search Exploration Terms
Tristan Cazenave
TL;DR
The paper tackles improving Monte Carlo Tree Search by discovering new root exploration terms through an empirical, expression-based search. It introduces an expression discovery game and an AMAF-based sampling approach to rapidly evaluate candidate terms, aiming to enhance PUCT and SHUSS, especially under small budgeting. The authors demonstrate that the discovered terms can achieve competitive performance against standard PUCT and improve SHUSS, supported by a Go-focused experimental pipeline including a Katago Go dataset and a fast SHUSS evaluation dataset. The work offers a simple, scalable method for AI-driven algorithm design with potential applicability beyond Go to other decision problems and domains.
Abstract
Monte Carlo Tree Search and Monte Carlo Search have good results for many combinatorial problems. In this paper we propose to use Monte Carlo Search to design mathematical expressions that are used as exploration terms for Monte Carlo Tree Search algorithms. The optimized Monte Carlo Tree Search algorithms are PUCT and SHUSS. We automatically design the PUCT and the SHUSS root exploration terms. For small search budgets of 32 evaluations the discovered root exploration terms make both algorithms competitive with usual PUCT.
