Parametric-Task MAP-Elites
Timothée Anne, Jean-Baptiste Mouret
TL;DR
Parametric-Task MAP-Elites (PT-ME) tackles continuous multi-task optimization by learning a mapping $G: \Theta \rightarrow \mathcal{X}$ that returns the optimal solution $x^*_{\theta}$ for any task parameter $\theta$. It achieves this by sampling a new task each iteration, employing a dual variation strategy (SBX with a bandit-tuned tournament and a local linear-regression operator), and storing all evaluations for distillation into a neural network predictor. Across 10-DoF Arm, Archery, and Door-Pulling, PT-ME outperforms baselines including PPO, CMA-ES, MT-ME, and ablations in both coverage (MR-QD-Score) and solution quality, demonstrating scalable, data-efficient parametric-task learning. The approach enables fast inference on unseen tasks via distillation and suggests directions for scaling to larger task spaces and alternative regression models.
Abstract
Optimizing a set of functions simultaneously by leveraging their similarity is called multi-task optimization. Current black-box multi-task algorithms only solve a finite set of tasks, even when the tasks originate from a continuous space. In this paper, we introduce Parametric-Task MAP-Elites (PT-ME), a new black-box algorithm for continuous multi-task optimization problems. This algorithm (1) solves a new task at each iteration, effectively covering the continuous space, and (2) exploits a new variation operator based on local linear regression. The resulting dataset of solutions makes it possible to create a function that maps any task parameter to its optimal solution. We show that PT-ME outperforms all baselines, including the deep reinforcement learning algorithm PPO on two parametric-task toy problems and a robotic problem in simulation.
