PearSAN: A Machine Learning Method for Inverse Design using Pearson Correlated Surrogate Annealing
Michael Bezick, Blake A. Wilson, Vaishnavi Iyer, Yuheng Chen, Vladimir M. Shalaev, Sabre Kais, Alexander V. Kildishev, Alexandra Boltasseva, Brad Lackey
TL;DR
PearSON Correlated Surrogate Annealing (PearSAN) tackles inverse design in high-dimensional metasurface problems by combining a discretized latent space from a pretrained generator with a surrogate objective trained via a Pearson correlation-based loss (PearSOL). The framework uses variational neural annealing (VCA) to sample latent vectors in a way that is antitonic to the true figure-of-merit (FOM), enabling efficient discovery of high-performance designs without retraining the outer decoder. Empirically, PearSAN outperforms energy-matching surrogates and prior ML-based or topology-optimization methods, achieving a maximum design efficiency around 97% and orders-of-magnitude faster sample generation for TPV emitter metasurface designs. The approach is generalizable to any pretrained discretized latent-space generator and offers substantial speedups and design-quality gains for complex, physics-constrained inverse design tasks, with potential extensions to other energy models and continuous latent spaces.
Abstract
PearSAN is a machine learning-assisted optimization algorithm applicable to inverse design problems with large design spaces, where traditional optimizers struggle. The algorithm leverages the latent space of a generative model for rapid sampling and employs a Pearson correlated surrogate model to predict the figure of merit of the true design metric. As a showcase example, PearSAN is applied to thermophotovoltaic (TPV) metasurface design by matching the working bands between a thermal radiator and a photovoltaic cell. PearSAN can work with any pretrained generative model with a discretized latent space, making it easy to integrate with VQ-VAEs and binary autoencoders. Its novel Pearson correlational loss can be used as both a latent regularization method, similar to batch and layer normalization, and as a surrogate training loss. We compare both to previous energy matching losses, which are shown to enforce poor regularization and performance, even with upgraded affine parameters. PearSAN achieves a state-of-the-art maximum design efficiency of 97%, and is at least an order of magnitude faster than previous methods, with an improved maximum figure-of-merit gain.
