Diversity By Design: Leveraging Distribution Matching for Offline Model-Based Optimization
Michael S. Yao, James C. Gee, Osbert Bastani
TL;DR
Diversity is critical in offline model-based optimization to capture multiple high-quality design configurations. DynAMO treats diversity as distribution matching between the generator's design distribution and a temperature-weighted reference from the offline dataset, augmented with an adversarial critic constraint to bound out-of-distribution evaluation. By deriving a closed-form dual via Lagrangian optimization, DynAMO remains compatible with a wide range of optimizers and tasks, demonstrated across diverse Design-Bench problems and Molecule design. Empirically, DynAMO yields substantial gains in candidate diversity while maintaining strong quality, though trade-offs emerge with hyperparameters and certain metrics. The approach offers a scalable, task- and optimizer-agnostic path to diversify offline design exploration and supports downstream multi-objective decision-making in scientific domains.
Abstract
The goal of offline model-based optimization (MBO) is to propose new designs that maximize a reward function given only an offline dataset. However, an important desiderata is to also propose a diverse set of final candidates that capture many optimal and near-optimal design configurations. We propose Diversity in Adversarial Model-based Optimization (DynAMO) as a novel method to introduce design diversity as an explicit objective into any MBO problem. Our key insight is to formulate diversity as a distribution matching problem where the distribution of generated designs captures the inherent diversity contained within the offline dataset. Extensive experiments spanning multiple scientific domains show that DynAMO can be used with common optimization methods to significantly improve the diversity of proposed designs while still discovering high-quality candidates.
