Learning to Simulate: Generative Metamodeling via Quantile Regression
L. Jeff Hong, Yanxi Hou, Qingkai Zhang, Xiaowei Zhang
TL;DR
This work introduces generative metamodeling to bridge the gap between slow stochastic simulators and the need for real-time decision support. It proposes QRGMM, which learns the conditional quantile function $F_Y^{-1}(\tau|\mathbf{x})$ offline on a grid and rapidly generates samples online via interpolation of $\hat{Q}(\tau|\mathbf{x})$, yielding approximate samples from $Y(\mathbf{x})$ while preserving distributional fidelity. A rigorous convergence analysis leverages a novel tail-partition scheme to establish uniform convergence of $\hat{Y}(\mathbf{x})$ to $Y(\mathbf{x})$ as $n,m\to\infty$; practical guidance shows $m=O(\sqrt{n})$ balances estimation and interpolation errors and mitigates quantile crossing through rearrangement. Extensions to multidimensional outputs and nonlinear quantile models broaden applicability, and numerical experiments—including artificial benchmarks and an esophageal cancer treatment simulator—demonstrate QRGMM’s superior distributional accuracy and dramatic online generation speed relative to competing generative models. Collectively, the framework enables real-time computation of arbitrary statistics from the simulator’s conditional distribution, with potential impact across operations research, healthcare, and finance.
Abstract
Stochastic simulation models effectively capture complex system dynamics but are often too slow for real-time decision-making. Traditional metamodeling techniques learn relationships between simulator inputs and a single output summary statistic, such as the mean or median. These techniques enable real-time predictions without additional simulations. However, they require prior selection of one appropriate output summary statistic, limiting their flexibility in practical applications. We propose a new concept: generative metamodeling. It aims to construct a "fast simulator of the simulator," generating random outputs significantly faster than the original simulator while preserving approximately equal conditional distributions. Generative metamodels enable rapid generation of numerous random outputs upon input specification, facilitating immediate computation of any summary statistic for real-time decision-making. We introduce a new algorithm, quantile-regression-based generative metamodeling (QRGMM), and establish its distributional convergence and convergence rate. Extensive numerical experiments demonstrate QRGMM's efficacy compared to other state-of-the-art generative algorithms in practical real-time decision-making scenarios.
