FoMEMO: Towards Foundation Models for Expensive Multi-objective Optimization
Yiming Yao, Fei Liu, Liang Zhao, Xi Lin, Yilu Liu, Qingfu Zhang
TL;DR
FoMEMO proposes a foundation-model framework for expensive multi-objective optimization by pre-training a transformer-based Prior-Data Fitted Network on hundreds of millions of synthetic trajectories to predict aggregated posteriors conditioned on domain context and user preferences. In the in-context phase, the model enables fast candidate generation via preference-based or preference-free acquisition functions without any further model updates, achieving high efficiency and strong generalization across unseen problems. Key contributions include synthetic data generation for broad problem coverage, objective-aware regression heads, and two acquisition families (EI/UCB and UHVI/UR2I) that operate on aggregated posteriors. The approach demonstrates competitive or superior performance and scalable runtime behavior across synthetic, engineering-design, and HPO tasks, offering a practical, adaptable paradigm for real-world MOBO in diverse domains.
Abstract
Expensive multi-objective optimization is a prevalent and crucial concern in many real-world scenarios, where sample-efficiency is vital due to the limited evaluations to recover the true Pareto front for decision making. Existing works either involve rebuilding Gaussian process surrogates from scratch for each objective in each new problem encountered, or rely on extensive past domain experiments for pre-training deep learning models, making them hard to generalize and impractical to cope with various emerging applications in the real world. To address this issue, we propose a new paradigm named FoMEMO (Foundation Models for Expensive Multi-objective Optimization), which enables the establishment of a foundation model conditioned on any domain trajectory and user preference, and facilitates fast in-context optimization based on the predicted preference-wise aggregated posteriors. Rather than accessing extensive real-world domain experiments for training, we demonstrate that pre-training the foundation model with a diverse set of hundreds of millions of synthetic data can lead to superior generalization and optimization performance to unknown problems, without necessitating any subsequent model training or updates in the following optimization process.
