Feasibility Study on Active Learning of Smart Surrogates for Scientific Simulations
Pradeep Bajracharya, Javier Quetzalcóatl Toledo-Marín, Geoffrey Fox, Shantenu Jha, Linwei Wang
TL;DR
High-performance diffusion simulations incur heavy data-generation costs across large parameter spaces. The authors integrate active learning to train DNN surrogates that selectively query simulations, reducing labeled data needs. In offline emulations of a two-source diffusion problem on a $100\times100$ lattice, TOD-based acquisition with a suitable architecture (notably U-Net) achieved lower weighted MAE with less labeled data than baselines, while the benefits depend strongly on the network design. This work lays a foundation for Smart Surrogates in HPC, enabling on-the-fly data generation steered by active learning to accelerate complex scientific computations.
Abstract
High-performance scientific simulations, important for comprehension of complex systems, encounter computational challenges especially when exploring extensive parameter spaces. There has been an increasing interest in developing deep neural networks (DNNs) as surrogate models capable of accelerating the simulations. However, existing approaches for training these DNN surrogates rely on extensive simulation data which are heuristically selected and generated with expensive computation -- a challenge under-explored in the literature. In this paper, we investigate the potential of incorporating active learning into DNN surrogate training. This allows intelligent and objective selection of training simulations, reducing the need to generate extensive simulation data as well as the dependency of the performance of DNN surrogates on pre-defined training simulations. In the problem context of constructing DNN surrogates for diffusion equations with sources, we examine the efficacy of diversity- and uncertainty-based strategies for selecting training simulations, considering two different DNN architecture. The results set the groundwork for developing the high-performance computing infrastructure for Smart Surrogates that supports on-the-fly generation of simulation data steered by active learning strategies to potentially improve the efficiency of scientific simulations.
