Multilevel CNNs for Parametric PDEs based on Adaptive Finite Elements
Janina Enrica Schütte, Martin Eigel
TL;DR
The paper develops a multilevel CNN framework that emulates an adaptive finite element method (AFEM) for parametric PDEs, using adaptively refined FE data and a posteriori error estimators as inputs. By representing FE corrections, estimators, and refinement masks as images, it leverages submanifold sparse CNNs to perform level-wise multigrid-like updates with local corrections, achieving provable approximation guarantees of the AFEM steps. The authors derive quantitative complexity bounds showing CNNs can approximate the multigrid solver, estimator, and the full AFEM with logarithmic dependence on the inverse accuracy and linear dependence on the number of refinement levels and iterations. Numerical experiments on a cookie-type parametric diffusion problem demonstrate the approach conceptually, with CNNs closely tracking adaptive solver performance and estimator-driven marking. The work advances efficient, data-driven surrogates for high-dimensional parametric PDEs by integrating adaptive discretization, error estimation, and localized neural computations.
Abstract
A neural network architecture is presented that exploits the multilevel properties of high-dimensional parameter-dependent partial differential equations, enabling an efficient approximation of parameter-to-solution maps, rivaling best-in-class methods such as low-rank tensor regression in terms of accuracy and complexity. The neural network is trained with data on adaptively refined finite element meshes, thus reducing data complexity significantly. Error control is achieved by using a reliable finite element a posteriori error estimator, which is also provided as input to the neural network. The proposed U-Net architecture with CNN layers mimics a classical finite element multigrid algorithm. It can be shown that the CNN efficiently approximates all operations required by the solver, including the evaluation of the residual-based error estimator. In the CNN, a culling mask set-up according to the local corrections due to refinement on each mesh level reduces the overall complexity, allowing the network optimization with localized fine-scale finite element data. A complete convergence and complexity analysis is carried out for the adaptive multilevel scheme, which differs in several aspects from previous non-adaptive multilevel CNN. Moreover, numerical experiments with common benchmark problems from Uncertainty Quantification illustrate the practical performance of the architecture.
