Deep adaptive sampling for surrogate modeling without labeled data
Xili Wang, Kejun Tang, Jiayu Zhai, Xiaoliang Wan, Chao Yang
TL;DR
This work tackles surrogate modeling for parametric differential equations in the absence of labeled data. It introduces $\mathrm{DAS}^2$, a physics-informed framework that uses a KRnet-based deep generative model to approximate the residual-induced sampling distribution and to adaptively generate collocation points in both space and parameter domains, thereby reducing discretization (statistical) error for low-regularity problems. The method is validated across multiple settings, including a parametric ODE, high-dimensional operator learning, a geometry-driven optimal control problem, and parametric lid-driven cavity flows, consistently outperforming uniform, RAR, and QRS strategies with faster convergence and competitive inference times. The results demonstrate that deep generative, residual-guided sampling can significantly improve all-at-once surrogates for high-dimensional, low-regularity PDEs without needing labeled simulation data, offering a practical path to rapid, reliable uncertainty quantification and design under parametric variation.
Abstract
Surrogate modeling is of great practical significance for parametric differential equation systems. In contrast to classical numerical methods, using physics-informed deep learning methods to construct simulators for such systems is a promising direction due to its potential to handle high dimensionality, which requires minimizing a loss over a training set of random samples. However, the random samples introduce statistical errors, which may become the dominant errors for the approximation of low-regularity and high-dimensional problems. In this work, we present a deep adaptive sampling method for surrogate modeling ($\text{DAS}^2$), where we generalize the deep adaptive sampling (DAS) method [62] [Tang, Wan and Yang, 2023] to build surrogate models for low-regularity parametric differential equations. In the parametric setting, the residual loss function can be regarded as an unnormalized probability density function (PDF) of the spatial and parametric variables. This PDF is approximated by a deep generative model, from which new samples are generated and added to the training set. Since the new samples match the residual-induced distribution, the refined training set can further reduce the statistical error in the current approximate solution. We demonstrate the effectiveness of $\text{DAS}^2$ with a series of numerical experiments, including the parametric lid-driven 2D cavity flow problem with a continuous range of Reynolds numbers from 100 to 1000.
