Table of Contents
Fetching ...

Assessing the performance of correlation-based multi-fidelity neural emulators

Cristian J. Villatoro, Gianluca Geraci, Daniele E. Schiavazzi

TL;DR

This study systematically evaluates three neural architectures (MLP, Siren, KAN) for multi-fidelity emulation, integrating diverse low-fidelity sources with limited high-fidelity data. It introduces coordinate encoding to reconcile differing input spaces and compares standard versus encoded MF networks across 1D/2D, PDE, and high-dimensional problems, including discontinuities and phase shifts. Key findings show that MF frameworks typically outperform HF-only surrogates, with KAN delivering the strongest performance on complex and high-dimensional tasks, while coordinate encoding is most beneficial when LF–HF relationships exhibit misalignment or discontinuities. The results offer guidance on architecture choice and encoding strategies for efficient, accurate HF surrogates in computationally expensive scientific applications.

Abstract

Outer loop tasks such as optimization, uncertainty quantification or inference can easily become intractable when the underlying high-fidelity model is computationally expensive. Similarly, data-driven architectures typically require large datasets to perform predictive tasks with sufficient accuracy. A possible approach to mitigate these challenges is the development of multi-fidelity emulators, leveraging potentially biased, inexpensive low-fidelity information while correcting and refining predictions using scarce, accurate high-fidelity data. This study investigates the performance of multi-fidelity neural emulators, neural networks designed to learn the input-to-output mapping by integrating limited high-fidelity data with abundant low-fidelity model solutions. We investigate the performance of such emulators for low and high-dimensional functions, with oscillatory character, in the presence of discontinuities, for collections of models with equal and dissimilar parametrization, and for a possibly large number of potentially corrupted low-fidelity sources. In doing so, we consider a large number of architectural, hyperparameter, and dataset configurations including networks with a different amount of spectral bias (Multi-Layered Perceptron, Siren and Kolmogorov Arnold Network), various mechanisms for coordinate encoding, exact or learnable low-fidelity information, and for varying training dataset size. We further analyze the added value of the multi-fidelity approach by conducting equivalent single-fidelity tests for each case, quantifying the performance gains achieved through fusing multiple sources of information.

Assessing the performance of correlation-based multi-fidelity neural emulators

TL;DR

This study systematically evaluates three neural architectures (MLP, Siren, KAN) for multi-fidelity emulation, integrating diverse low-fidelity sources with limited high-fidelity data. It introduces coordinate encoding to reconcile differing input spaces and compares standard versus encoded MF networks across 1D/2D, PDE, and high-dimensional problems, including discontinuities and phase shifts. Key findings show that MF frameworks typically outperform HF-only surrogates, with KAN delivering the strongest performance on complex and high-dimensional tasks, while coordinate encoding is most beneficial when LF–HF relationships exhibit misalignment or discontinuities. The results offer guidance on architecture choice and encoding strategies for efficient, accurate HF surrogates in computationally expensive scientific applications.

Abstract

Outer loop tasks such as optimization, uncertainty quantification or inference can easily become intractable when the underlying high-fidelity model is computationally expensive. Similarly, data-driven architectures typically require large datasets to perform predictive tasks with sufficient accuracy. A possible approach to mitigate these challenges is the development of multi-fidelity emulators, leveraging potentially biased, inexpensive low-fidelity information while correcting and refining predictions using scarce, accurate high-fidelity data. This study investigates the performance of multi-fidelity neural emulators, neural networks designed to learn the input-to-output mapping by integrating limited high-fidelity data with abundant low-fidelity model solutions. We investigate the performance of such emulators for low and high-dimensional functions, with oscillatory character, in the presence of discontinuities, for collections of models with equal and dissimilar parametrization, and for a possibly large number of potentially corrupted low-fidelity sources. In doing so, we consider a large number of architectural, hyperparameter, and dataset configurations including networks with a different amount of spectral bias (Multi-Layered Perceptron, Siren and Kolmogorov Arnold Network), various mechanisms for coordinate encoding, exact or learnable low-fidelity information, and for varying training dataset size. We further analyze the added value of the multi-fidelity approach by conducting equivalent single-fidelity tests for each case, quantifying the performance gains achieved through fusing multiple sources of information.

Paper Structure

This paper contains 33 sections, 26 equations, 40 figures, 2 tables.

Figures (40)

  • Figure 1: Schematic of standard and encoded multi-fidelity neural networks.
  • Figure 2: Mean normalized HF MSE results for example K1.
  • Figure 3: Best HF and LF model fitting results plotted against using 8/64 HF/LF training samples and an exact LF model for a Siren network with no encoding (left) with the learned linear/nonlinear HF correlation (right).
  • Figure 4: Mean normalized HF MSE over 4 repetitions for example K2 using various network configurations.
  • Figure 5: HF and LF model fitting results for 32/256 HF/LF training samples, and exact LF predictor. We consider a network with no encoding (left), linear encoding (middle), and nonlinear encoding (right) for example K2.
  • ...and 35 more figures