Transformer-Powered Surrogates Close the ICF Simulation-Experiment Gap with Extremely Limited Data
Matthew L. Olson, Shusen Liu, Jayaraman J. Thiagarajan, Bogdan Kustowski, Weng-Keen Wong, Rushil Anirudh
TL;DR
This work tackles bridging the simulation–experiment gap in inertial confinement fusion under extremely limited real data. It introduces a transformer-based surrogate trained on large-scale simulations and refined with masked autoencoding, coupled with a novel graph-based hyper-parameter optimization to combat overfitting and noisy validation signals. The methodology enables effective multi-modal transfer learning (scalars and X-ray images) and demonstrates a substantial reduction in predictive error, notably a ~40% relative improvement over state-of-the-art neural surrogates, with robust performance across real and synthetic benchmarks. The approach offers a scalable path for accurate, data-efficient predictions in physics-guided ML and can be extended to other domains with similar simulation–experiment gaps.
Abstract
Recent advances in machine learning, specifically transformer architecture, have led to significant advancements in commercial domains. These powerful models have demonstrated superior capability to learn complex relationships and often generalize better to new data and problems. This paper presents a novel transformer-powered approach for enhancing prediction accuracy in multi-modal output scenarios, where sparse experimental data is supplemented with simulation data. The proposed approach integrates transformer-based architecture with a novel graph-based hyper-parameter optimization technique. The resulting system not only effectively reduces simulation bias, but also achieves superior prediction accuracy compared to the prior method. We demonstrate the efficacy of our approach on inertial confinement fusion experiments, where only 10 shots of real-world data are available, as well as synthetic versions of these experiments.
