Worst-Case Learning under a Multi-fidelity Model
Simon Foucart, Nicolas Hengartner
TL;DR
This work reframes multi-fidelity surrogate design as an Optimal Recovery problem to obtain deterministic, worst-case error bounds. By modeling high- and low-fidelity codes with sets $\mathcal{K}_0$ and $\mathcal{K}_1$ and data constraints, it derives both global and local recovery strategies: (i) globally optimal estimation of linear functionals via linear, efficiently computable coefficient maps; (ii) globally optimal recovery in Hilbert spaces using a constrained regularization parameter $\tau^{\sharp}$ solved by SDP; and (iii) a Chebyshev-center analysis for the intersection of two hyperellipsoids to bound and certify local radius of information. It further introduces locally optimal recovery in Hilbert spaces with data-dependent regularization parameters, and provides SDP formulations to compute optimal arms, including cases where the two-fidelity setting suffices or where S-procedure limitations arise. The approach yields explicit, verifiable, deterministic uncertainty bounds for multi-fidelity surrogates and offers scalable, sparse recovery procedures with potential extensions to time-series and transfer-learning contexts.
Abstract
Inspired by multi-fidelity methods in computer simulations, this article introduces procedures to design surrogates for the input/output relationship of a high-fidelity code. These surrogates should be learned from runs of both the high-fidelity and low-fidelity codes and be accompanied by error guarantees that are deterministic rather than stochastic. For this purpose, the article advocates a framework tied to a theory focusing on worst-case guarantees, namely Optimal Recovery. The multi-fidelity considerations triggered new theoretical results in three scenarios: the globally optimal estimation of linear functionals, the globally optimal approximation of arbitrary quantities of interest in Hilbert spaces, and their locally optimal approximation, still within Hilbert spaces. The latter scenario boils down to the determination of the Chebyshev center for the intersection of two hyperellipsoids. It is worth noting that the mathematical framework presented here, together with its possible extension, seems to be relevant in several other contexts briefly discussed.
