Convolution Operator Network for Forward and Inverse Problems (FI-Conv): Application to Plasma Turbulence Simulations
Xingzhuo Chen, Anthony Poole, Ionut-Gabriel Farcas, David R. Hatch, Ulisses Braga-Neto
TL;DR
FI-Conv introduces a Convolutional Operator Network that unifies forward prediction and inverse PDE parameter estimation within a single, autoregressive framework. By embedding initial state, PDE parameters, and evolution time into a U-Net equipped with ConvNeXt V2 blocks, the method achieves accurate short-term forecasts and statistically faithful long-term behavior for turbulent HW plasma dynamics, while enabling gradient-based parameter inference without retraining. The approach is demonstrated on a multi-parameter HW model, showing competitive forward accuracy, preservation of high-frequency structures, and effective inverse estimation for multiple parameters, with k0 and cpb being most readily inferred. The results suggest FI-Conv as a scalable, data-driven surrogate for complex, multiscale PDEs and a practical tool for plasma diagnostics and optimization in tokamak-like systems.
Abstract
We propose the Convolutional Operator Network for Forward and Inverse Problems (FI-Conv), a framework capable of predicting system evolution and estimating parameters in complex spatio-temporal dynamics, such as turbulence. FI-Conv is built on a U-Net architecture, in which most convolutional layers are replaced by ConvNeXt V2 blocks. This design preserves U-Net performance on inputs with high-frequency variations while maintaining low computational complexity. FI-Conv uses an initial state, PDE parameters, and evolution time as input to predict the system future state. As a representative example of a system exhibiting complex dynamics, we evaluate the performance of FI-Conv on the task of predicting turbulent plasma fields governed by the Hasegawa-Wakatani (HW) equations. The HW system models two-dimensional electrostatic drift-wave turbulence and exhibits strongly nonlinear behavior, making accurate approximation and long-term prediction particularly challenging. Using an autoregressive forecasting procedure, FI-Conv achieves accurate forward prediction of the plasma state evolution over short times (t ~ 3) and captures the statistic properties of derived physical quantities of interest over longer times (t ~ 100). Moreover, we develop a gradient-descent-based inverse estimation method that accurately infers PDE parameters from plasma state evolution data, without modifying the trained model weights. Collectively, our results demonstrate that FI-Conv can be an effective alternative to existing physics-informed machine learning methods for systems with complex spatio-temporal dynamics.
