Equation-informed data-driven identification of flow budgets and dynamics
Nataliya Sevryugina, Serena Costanzo, Stephen de Bruyn Kops, Colm-cille Caulfield, Iraj Mortazavi, Taraneh Sayadi
TL;DR
The paper introduces Budget Identification Algorithm (BIA), a physics-informed clustering framework that uses pointwise SINDy to extract equation-budget features and then applies Newman modularity clustering to identify dynamically distinct flow regions. It extends to a dynamic Lagrangian formulation (Dynamic-BIA) that tracks cluster evolution as flow structures move, demonstrated on flow around a cylinder and on turbulent stratified flows using TBV and TKE budgets. Results show interpretable clusters corresponding to physically meaningful regimes and indicate TBV provides clearer separation for buoyancy-driven dynamics in stratified turbulence. The work enables interpretable, region-specific model selection and paves the way for hybrid CFD strategies that adapt fidelity by region and time, with open-source tooling potential via PySINDy integration.
Abstract
Computational Fluid Dynamics (CFD) is an indispensable method of fluid modelling in engineering applications, reducing the need for physical prototypes and testing for tasks such as design optimisation and performance analysis. Depending on the complexity of the system under consideration, models ranging from low to high fidelity can be used for prediction, allowing significant speed-up. However, the choice of model requires information about the actual dynamics of the flow regime. Correctly identifying the regions/clusters of flow that share the same dynamics has been a challenging research topic to date. In this study, we propose a novel hybrid approach to flow clustering. It consists of characterising each sample point of the system with equation-based features, i.e. features are budgets that represent the contribution of each term from the original governing equation to the local dynamics at each sample point. This was achieved by applying the Sparse Identification of Nonlinear Dynamical systems (SINDy) method pointwise to time evolution data. The method proceeds with equation-based clustering using the Girvan-Newman algorithm. This allows the detection of communities that share the same physical dynamics. The algorithm is implemented in both Eulerian and Lagrangian frameworks. In the Lagrangian, i.e. dynamic approach, the clustering is performed on the trajectory of each point, allowing the change of clusters to be represented also in time. The performance of the algorithm is first tested on a flow around a cylinder. The construction of the dynamic clusters in this test case clearly shows the evolution of the wake from the steady state solution through the transient to the oscillatory solution. Dynamic clustering was then successfully tested on turbulent flow data. Two distinct and well-defined clusters were identified and their temporal evolution was reconstructed.
