Tree-Structured Parzen Estimator: Understanding Its Algorithm Components and Their Roles for Better Empirical Performance
Shuhei Watanabe
TL;DR
The paper demystifies the Tree-structured Parzen Estimator by systematically abating its components and showing how each control parameter influences exploitation versus exploration. It provides a comprehensive set of default, empirically validated settings, including multivariate KDEs, priors, and bandwidth strategies, and demonstrates robust performance improvements over traditional TPE implementations and several baselines. The work emphasizes noise-aware bandwidth selection and presents practical guidelines for applying TPE across diverse benchmark and real-world HPO tasks. Overall, it offers actionable insights to tune TPE for better empirical performance with clear recommendations and thorough experimental validation.
Abstract
Recent scientific advances require complex experiment design, necessitating the meticulous tuning of many experiment parameters. Tree-structured Parzen estimator (TPE) is a widely used Bayesian optimization method in recent parameter tuning frameworks such as Hyperopt and Optuna. Despite its popularity, the roles of each control parameter in TPE and the algorithm intuition have not been discussed so far. The goal of this paper is to identify the roles of each control parameter and their impacts on parameter tuning based on the ablation studies using diverse benchmark datasets. The recommended setting concluded from the ablation studies is demonstrated to improve the performance of TPE. Our TPE implementation used in this paper is available at https://github.com/nabenabe0928/tpe/tree/single-opt.
