Jacobian-Scaled K-means Clustering for Physics-Informed Segmentation of Reacting Flows

Shivam Barwey; Venkat Raman

Jacobian-Scaled K-means Clustering for Physics-Informed Segmentation of Reacting Flows

Shivam Barwey, Venkat Raman

TL;DR

This paper introduces Jacobian-scaled K-means (JSK-means), a physics-informed clustering method that replaces the standard Euclidean distance with a centroid-dependent, Jacobian-weighted distance to capture dynamical similarity in thermochemical composition space. By deriving scaling matrices from the chemical Jacobian evaluated at centroids, JSK-means biases cluster centroids toward regions of high chemical sensitivity while preserving the input data. Demonstrations on a high-fidelity hydrogen–air detonation dataset show that JSK-means concentrates clusters near the wavefront and triple-point zones, and POD analyses confirm distinct dynamical separation in composition and source-term spaces. The approach promises improved partition-based modeling in reacting flows and multi-physics problems, with potential extensions to other clustering methods and applications.

Abstract

This work introduces Jacobian-scaled K-means (JSK-means) clustering, which is a physics-informed clustering strategy centered on the K-means framework. The method allows for the injection of underlying physical knowledge into the clustering procedure through a distance function modification: instead of leveraging conventional Euclidean distance vectors, the JSK-means procedure operates on distance vectors scaled by matrices obtained from dynamical system Jacobians evaluated at the cluster centroids. The goal of this work is to show how the JSK-means algorithm -- without modifying the input dataset -- produces clusters that capture regions of dynamical similarity, in that the clusters are redistributed towards high-sensitivity regions in phase space and are described by similarity in the source terms of samples instead of the samples themselves. The algorithm is demonstrated on a complex reacting flow simulation dataset (a channel detonation configuration), where the dynamics in the thermochemical composition space are known through the highly nonlinear and stiff Arrhenius-based chemical source terms. Interpretations of cluster partitions in both physical space and composition space reveal how JSK-means shifts clusters produced by standard K-means towards regions of high chemical sensitivity (e.g., towards regions of peak heat release rate near the detonation reaction zone). The findings presented here illustrate the benefits of utilizing Jacobian-scaled distances in clustering techniques, and the JSK-means method in particular displays promising potential for improving former partition-based modeling strategies in reacting flow (and other multi-physics) applications.

Jacobian-Scaled K-means Clustering for Physics-Informed Segmentation of Reacting Flows

TL;DR

Abstract

Paper Structure (21 sections, 39 equations, 15 figures, 1 table, 2 algorithms)

This paper contains 21 sections, 39 equations, 15 figures, 1 table, 2 algorithms.

Introduction
Dataset
Methodology
Standard K-means Clustering
Jacobian-Scaled K-means Clustering
Jacobian-Scaled K-means Algorithm
Normalization Procedure for Jacobian Regularization
Results
Demonstrative Toy Problem
Detonation Flowfield
Analysis of Clusters in Physical Space via Segmented Fields
Analysis of Clusters in Composition Space via Proper Orthogonal Decomposition
Applications and Need for JSK-Means
JSK-means Use Cases
Distinction from Source Term Clustering
...and 6 more sections

Figures (15)

Figure 1: Computational domain for channel detonation simulation.
Figure 2: Cropping procedure for an instantaneous detonation flowfield at $t=50$$\mu$s used to generate the clustering dataset. Colorbar ranges for $T$, $\rho Y_{H2}$, and $\rho Y_{H2O}$ are $[300,3200]$ K, $[0.001,0.006]$$\text{kg}/\text{m}^3\text{s}$, and $[0.1,0.2]$$\text{kg}/\text{m}^3\text{s}$ respectively.
Figure 3: Singular value distribution for un-normalized (top) and normalized (bottom) chemical Jacobians obtained from dataset $\Phi$. The x-axis denotes the singular value index. For each index, spread in corresponding singular value for all $N$ sample points in $\Phi$ is plotted and colored by temperature. Points correspond to samples above 500 K. Bands are alternately shaded for ease of visualization.
Figure 4: (Left) Plot of source term (black) and Jacobian (blue) versus phase space variable $\phi$ for the 1D toy problem in Eq. \ref{['eq:1d_toy_problem']}. Variables in the plot have been normalized as per the procedure outlined in Sec. \ref{['sec:scaling']}. (Right) Plot of normalized objective functions for standard (solid) and physics-guided (dashed) K-means clustering approaches versus number of iterations of the K-means algorithm. The first 300 iterations is a burn-in phase that utilizes the standard K-means algorithm in Alg. \ref{['alg:standard_kmeans']}. The next 300 iterations utilize the modified physics-guided K-means algorithm in Alg. \ref{['alg:jacobian_kmeans']}.
Figure 5: Cluster visualizations provided by standard K-means (top row) and JSK-means algorithm (bottom row) for $K=5$, $10$, and $15$. Centroid locations are provided as the filled markers in each plot.
...and 10 more figures

Jacobian-Scaled K-means Clustering for Physics-Informed Segmentation of Reacting Flows

TL;DR

Abstract

Jacobian-Scaled K-means Clustering for Physics-Informed Segmentation of Reacting Flows

Authors

TL;DR

Abstract

Table of Contents

Figures (15)