Sparsifying dimensionality reduction of PDE solution data with Bregman learning
Tjeerd Jan Heeringa, Christoph Brune, Mengwu Guo
TL;DR
The paper tackles the challenge of compressing PDE solution data with nonlinear projections while controlling model size. It introduces a multistep sparsification pipeline that trains autoencoders using linearized Bregman iterations with sparsity- and low-rank-inducing regularizers, followed by latent-SVD truncation and bias-propagation post-processing. Across 1D diffusion, 1D advection, and 2D reaction-diffusion, the approach achieves accuracy comparable to standard optimizers while reducing parameters by about 30% and shrinking the latent space by roughly 60%, with AdaBreg often delivering the best sparsity-accuracy trade-off. This yields practical, efficient reduced-order models for PDE data and provides a framework for principled latent-dimension control via sparsity and post-processing.
Abstract
Classical model reduction techniques project the governing equations onto a linear subspace of the original state space. More recent data-driven techniques use neural networks to enable nonlinear projections. Whilst those often enable stronger compression, they may have redundant parameters and lead to suboptimal latent dimensionality. To overcome these, we propose a multistep algorithm that induces sparsity in the encoder-decoder networks for effective reduction in the number of parameters and additional compression of the latent space. This algorithm starts with sparsely initialized a network and training it using linearized Bregman iterations. These iterations have been very successful in computer vision and compressed sensing tasks, but have not yet been used for reduced-order modelling. After the training, we further compress the latent space dimensionality by using a form of proper orthogonal decomposition. Last, we use a bias propagation technique to change the induced sparsity into an effective reduction of parameters. We apply this algorithm to three representative PDE models: 1D diffusion, 1D advection, and 2D reaction-diffusion. Compared to conventional training methods like Adam, the proposed method achieves similar accuracy with 30% less parameters and a significantly smaller latent space.
