Optimal ANOVA-based emulators of models with(out) derivatives
Matieyendou Lamboni
TL;DR
The paper develops derivative-based and derivative-free ANOVA-based emulators (Db-ANOVA) for high-dimensional models by exploiting Sobol' sensitivity indices to structure truncations and interactions. Derivative-based emulators yield dimension-free MSEs and a parametric convergence rate of $O(N^{-1})$ when cross-partial derivatives are available, while derivative-free surrogates extend to general input distributions with explicit bias and MSE bounds. It introduces stochastic surrogates, provides rigorous bias and MSE analyses under Hölder smoothness and general input distributions, and demonstrates the methods on Ishigami and Sobol' g-functions as well as a heat-diffusion PDE with stochastic initial conditions. The results show both approaches can efficiently approximate complex simulators, with derivative-based emulators offering stronger theoretical guarantees in high dimensions and derivative-free emulators providing robust performance with tractable sample requirements. The work also outlines practical truncation strategies and local emulator possibilities, pointing to future enhancements and broader applicability to empirical data.
Abstract
This paper proposes new ANOVA-based approximations of functions and emulators of high-dimensional models using either available derivatives or local stochastic evaluations of such models. Our approach makes use of sensitivity indices to design adequate structures of emulators. For high-dimensional models with available derivatives, our derivative-based emulators reach dimension-free mean squared errors (MSEs) and parametric rate of convergence (i.e., $\mathsf{O}(N^{-1})$). This approach is extended to cope with every model (without available derivatives) by deriving global emulators that account for the local properties of models or simulators. Such generic emulators enjoy dimension-free biases, parametric rates of convergence and MSEs that depend on the dimensionality. Dimension-free MSEs are obtained for high-dimensional models with particular inputs' distributions. Our emulators are also competitive in dealing with different distributions of the input variables and for selecting inputs and interactions. Simulations show the efficiency of our approach.
