Can Transformers overcome the lack of data in the simulation of history-dependent flows?
P. Urdeitx, I. Alfaro, D. Gonzalez, F. Chinesta, E. Cueto
TL;DR
This work investigates whether Transformer architectures can compensate for missing history-dependent variables in complex fluid dynamics by operating in a thermodynamically structured latent space. A three-stage framework first embeds observable states into a GENERIC-based latent manifold trained with a metriplectic integrator, then compares a structure-preserving neural network against a Transformer that evolves latent states in a sequence. Across three benchmarks—flow past a cylinder, Oldroyd-B Couette flow, and FENE polymeric fluid—the Transformer generally outperforms the incomplete SPNN when variables like the conformation tensor are unavailable, particularly in memory-driven, nonlinear regimes, while fully observed, history-free cases favor the SPNN baseline. The study demonstrates that attention mechanisms can implicitly recover historical dependencies, reducing reliance on hard-to-measure internal variables and offering a path toward robust, data-efficient modeling of non-Markovian dynamics.
Abstract
It is well known that the lack of information about certain variables necessary for the description of a dynamical system leads to the introduction of historical dependence (lack of Markovian character of the model) and noise. Traditionally, scientists have made up for these shortcomings by designing phenomenological variables that take into account this historical dependence (typically, conformational tensors in fluids). Often, these phenomenological variables are not easily measurable experimentally. In this work, we study to what extent Transformer architectures are able to cope with the lack of experimental data on these variables. The methodology is evaluated on three benchmark problems: a cylinder flow with no history dependence, a viscoelastic Couette flow modeled via the Oldroyd-B formalism, and a non-linear polymeric fluid described by the FENE model. Our results show that the Transformer outperforms a thermodynamically consistent, structure-preserving neural network with metriplectic bias in systems with missing experimental data, providing lower errors even in low-dimensional latent spaces. In contrast, for systems whose state variables can be fully known, the metriplectic model achieves superior performance.
