Beyond the Kolmogorov Barrier: A Learnable Weighted Hybrid Autoencoder for Model Order Reduction
Nithin Somasekharan, Shaowu Pan
TL;DR
This work introduces a Learnable Weighted Hybrid Autoencoder (LWH-AE) that blends POD/SVD with nonlinear neural encoders/decoders via learnable weights, enabling SVD-like convergence and improved generalization beyond the Kolmogorov barrier. The framework supports both pure reconstruction and time-dependent surrogate modeling by coupling with Koopman forecasting or LSTM dynamics, and demonstrates robust performance on chaotic and multi-scale PDEs including KS and HIT. Across KS, HIT, traveling wave, cylinder flow, Burgers, and shallow-water datasets, LWH-AE consistently outperforms POD, vanilla AE, and simple hybrid baselines, while achieving dramatically lower sharpness and greater noise robustness. The results indicate that high-quality reduced representations are the primary bottleneck in ROMs, and that the proposed approach yields practical gains for surrogate modeling and long-horizon predictions with minimal computational overhead.
Abstract
Representation learning for high-dimensional, complex physical systems aims to identify a low-dimensional intrinsic latent space, which is crucial for reduced-order modeling and modal analysis. To overcome the well-known Kolmogorov barrier, deep autoencoders (AEs) have been introduced in recent years, but they often suffer from poor convergence behavior as the rank of the latent space increases. To address this issue, we propose the learnable weighted hybrid autoencoder, a hybrid approach that combines the strengths of singular value decomposition (SVD) with deep autoencoders through a learnable weighted framework. We find that the introduction of learnable weighting parameters is essential -- without them, the resulting model would either collapse into a standard POD or fail to exhibit the desired convergence behavior. Interestingly, we empirically find that our trained model has a sharpness thousands of times smaller compared to other models. Our experiments on classical chaotic PDE systems, including the 1D Kuramoto-Sivashinsky and forced isotropic turbulence datasets, demonstrate that our approach significantly improves generalization performance compared to several competing methods. Additionally, when combining with time series modeling techniques (e.g., Koopman operator, LSTM), the proposed technique offers significant improvements for surrogate modeling of high-dimensional multi-scale PDE systems.
