Mitigating over-exploration in latent space optimization using LES
Omer Ronen, Ahmed Imtiaz Humayun, Richard Baraniuk, Randall Balestriero, Bin Yu
TL;DR
Latent Space Optimization for discrete, structured problems often yields invalid or impractical solutions due to over-exploration. The paper introduces Latent Exploration Score ($LES$), a differentiable, decoder-centric density-based constraint derived from the push-forward sequence density through a CPA-represented decoder, enabling seamless integration into LSO without retraining. Empirical results across five benchmark tasks and 22 VAEs show that $LES$ consistently improves solution validity and objective quality, with AUROC analyses confirming reliable identification of valid latent regions and with competitive or superior performance to multiple baselines. The method is computationally intensive due to determinant calculations but demonstrates strong robustness and portability across architectures, offering a practical path to more realistic generative-discovery in discrete spaces; open-source code is provided.
Abstract
We develop Latent Exploration Score (LES) to mitigate over-exploration in Latent Space Optimization (LSO), a popular method for solving black-box discrete optimization problems. LSO utilizes continuous optimization within the latent space of a Variational Autoencoder (VAE) and is known to be susceptible to over-exploration, which manifests in unrealistic solutions that reduce its practicality. LES leverages the trained decoder's approximation of the data distribution, and can be employed with any VAE decoder - including pretrained ones - without additional training, architectural changes or access to the training data. Our evaluation across five LSO benchmark tasks and twenty-two VAE models demonstrates that LES always enhances the quality of the solutions while maintaining high objective values, leading to improvements over existing solutions in most cases. We believe that new avenues to LSO will be opened by LES' ability to identify out of distribution areas, differentiability, and computational tractability. Open source code for LES is available at https://github.com/OmerRonen/les.
