ConfigX: Modular Configuration for Evolutionary Algorithms via Multitask Reinforcement Learning
Hongshu Guo, Zeyuan Ma, Jiacheng Chen, Yining Ma, Zhiguang Cao, Xinglin Zhang, Yue-Jiao Gong
TL;DR
ConfigX introduces a universal MetaBBO agent capable of configuring a broad space of evolutionary algorithms through Modular-BBO modularization and a Transformer-based multitask RL policy. By defining a joint optimization task space and leveraging module-aware attention, it learns a policy that generalizes to unseen EAs and problem instances, achieving strong zero-shot performance and efficient lifelong adaptation via fine-tuning. The approach demonstrates competitive to superior performance against SMAC3, with ablations highlighting the importance of architectural components such as MSA and positional encoding. This work advances automatic, all-purpose configuration for EAs, enabling scalable, data-driven EA design across diverse BBO tasks.
Abstract
Recent advances in Meta-learning for Black-Box Optimization (MetaBBO) have shown the potential of using neural networks to dynamically configure evolutionary algorithms (EAs), enhancing their performance and adaptability across various BBO instances. However, they are often tailored to a specific EA, which limits their generalizability and necessitates retraining or redesigns for different EAs and optimization problems. To address this limitation, we introduce ConfigX, a new paradigm of the MetaBBO framework that is capable of learning a universal configuration agent (model) for boosting diverse EAs. To achieve so, our ConfigX first leverages a novel modularization system that enables the flexible combination of various optimization sub-modules to generate diverse EAs during training. Additionally, we propose a Transformer-based neural network to meta-learn a universal configuration policy through multitask reinforcement learning across a designed joint optimization task space. Extensive experiments verify that, our ConfigX, after large-scale pre-training, achieves robust zero-shot generalization to unseen tasks and outperforms state-of-the-art baselines. Moreover, ConfigX exhibits strong lifelong learning capabilities, allowing efficient adaptation to new tasks through fine-tuning. Our proposed ConfigX represents a significant step toward an automatic, all-purpose configuration agent for EAs.
