Clarifying the effect of mean subtraction on Dynamic Mode Decomposition
Gowtham S Seenivasaharagavan, Milan Korda, Hassan Arbabi, Igor Mezić
TL;DR
The paper analyzes how subtracting the temporal mean in Dynamic Mode Decomposition affects the resulting spectral estimates when the observables admit only a finite number of Koopman modes. It establishes a geometric necessary-and-sufficient condition for muDMD to be equivalent to a temporal DFT and shows that linear consistency plus over-sampling prevents this equivalence, while under- or just-sampling can preserve it. Delay embedding is proposed as a practical tool to attain a Koopman-invariant dictionary, thereby severing muDMD–DFT equivalence in data-rich regimes. The work combines rigorous theory with numerical experiments on LTIs, the Van der Pol oscillator, and lid-driven cavity flow, and introduces the Relative distance to DFT as a numerically robust indicator of equivalence and data sufficiency. Collectively, the results provide a principled way to diagnose data adequacy for DMD and to design preprocessing or embedding strategies that avoid misleading DMD-DFT identifications.
Abstract
Any autonomous nonlinear dynamical system can be viewed as a superposition of infinitely many linear processes, through the so-called Koopman mode decomposition. Its data-driven approximation- Dynamic Mode Decomposition (DMD)- has been extensively developed and deployed across a plethora of fields. In this work, we study the effect of subtracting the temporal mean on the DMD approximation, for observables possessing only a finite number of Koopman modes. Pre-processing time-sequential training data by removing the temporal mean has been a point of contention in the Companion matrix formulation of DMD. This stems from the potential of said pre-processing to render DMD equivalent to a temporal Discrete Fourier Transform (DFT). We prove that this equivalence is impossible when the training data is linearly consistent and the order of the DMD model exceeds the number of Koopman modes. Since model order and training set size are synonymous in this variant of DMD, the parity of DMD and DFT can, therefore, be indicative of inadequate training data.
