Shallow Recurrent Decoder for Reduced Order Modeling of Plasma Dynamics
J. Nathan Kutz, Maryam Reza, Farbod Faraji, Aaron Knoll
TL;DR
The paper tackles the computational bottleneck of high-fidelity ${\bf E}\times {\bf B}$ plasma simulations by introducing SHALLOW REcurrent Decoder (SHRED), a reduced-order modeling framework that encodes time-series sensor data with an LSTM and decodes it to a full high-dimensional state via a shallow network. Grounded in the separation-of-variables principle, SHRED can operate on compressive representations derived from a low-rank basis ${\bf V}^{(k)}$ obtained through randomized SVD, enabling reconstruction of 14 coupled plasma fields from as few as three sensors and providing neural-network roll-outs for forecasting. The approach is demonstrated on a Hall-thruster–representative 2D radial-azimuthal problem, achieving accurate reconstructions and credible forecasts while drastically reducing training and inference costs (e.g., through randomized SVD reducing preprocessing time). These results suggest SHRED as a viable path toward fast, data-driven digital twins for plasma devices, with potential integration of measurements and simulations to refine physical models. Notation: the full high-dimensional state lies in ${\bf x}_T\in\mathbb{R}^n$ and the sensor measurements are ${\bf y}_t= {\bf C}{\bf x}_t$, with the low-rank representation ${\bf X}^{(k)}={\bf U}^{(k)}{\boldsymbol\Sigma}^{(k)}{{\bf V}^{(k)}}^*$ used to train a compact SHRED model.
Abstract
Reduced order models are becoming increasingly important for rendering complex and multiscale spatio-temporal dynamics computationally tractable. The computational efficiency of such surrogate models is especially important for design, exhaustive exploration and physical understanding. Plasma simulations, in particular those applied to the study of ${\bf E}\times {\bf B}$ plasma discharges and technologies, such as Hall thrusters, require substantial computational resources in order to resolve the multidimentional dynamics that span across wide spatial and temporal scales. Although high-fidelity computational tools are available to simulate such systems over limited conditions and in highly simplified geometries, simulations of full-size systems and/or extensive parametric studies over many geometric configurations and under different physical conditions are computationally intractable with conventional numerical tools. Thus, scientific studies and industrially oriented modeling of plasma systems, including the important ${\bf E}\times {\bf B}$ technologies, stand to significantly benefit from reduced order modeling algorithms. We develop a model reduction scheme based upon a {\em Shallow REcurrent Decoder} (SHRED) architecture. The scheme uses a neural network for encoding limited sensor measurements in time (sequence-to-sequence encoding) to full state-space reconstructions via a decoder network. Based upon the theory of separation of variables, the SHRED architecture is capable of (i) reconstructing full spatio-temporal fields with as little as three point sensors, even the fields that are not measured with sensor feeds but that are in dynamic coupling with the measured field, and (ii) forecasting the future state of the system using neural network roll-outs from the trained time encoding model.
