Multi-level Neural Networks for high-dimensional parametric obstacle problems

Martin Eigel; Cosmas Heiß; Janina E. Schütte

Multi-level Neural Networks for high-dimensional parametric obstacle problems

Martin Eigel, Cosmas Heiß, Janina E. Schütte

TL;DR

This work develops a multi-level CNN surrogate for high-dimensional parametric obstacle problems governed by an elliptic diffusion operator, framing the neural network through a multigrid perspective. It proves expressivity results showing the CNN can approximate a projected Richardson iteration with parameter counts that grow only polylogarithmically with accuracy, and that a multigrid-style V-cycle with monotone restriction can be emulated by the network. The approach decomposes the FE solution into coarse and fine-grid corrections and trains level-specific networks, achieving state-of-the-art accuracy on deterministic, stochastic, and rough-obstacle cases while maintaining stability as the parameter dimension grows. Practically, this yields efficient surrogates for repeated parametric solves in variational inequalities, with rigorous convergence insights and demonstrated empirical performance.

Abstract

A new method to solve computationally challenging (random) parametric obstacle problems is developed and analyzed, where the parameters can influence the related partial differential equation (PDE) and determine the position and surface structure of the obstacle. As governing equation, a stationary elliptic diffusion problem is assumed. The high-dimensional solution of the obstacle problem is approximated by a specifically constructed convolutional neural network (CNN). This novel algorithm is inspired by a finite element constrained multigrid algorithm to represent the parameter to solution map. This has two benefits: First, it allows for efficient practical computations since multi-level data is used as an explicit output of the NN thanks to an appropriate data preprocessing. This improves the efficacy of the training process and subsequently leads to small errors in the natural energy norm. Second, the comparison of the CNN to a multigrid algorithm provides means to carry out a complete a priori convergence and complexity analysis of the proposed NN architecture. Numerical experiments illustrate a state-of-the-art performance for this challenging problem.

Multi-level Neural Networks for high-dimensional parametric obstacle problems

TL;DR

Abstract

Multi-level Neural Networks for high-dimensional parametric obstacle problems

TL;DR

Abstract

Paper Structure

Table of Contents

Key Result

Figures (6)

Theorems & Definitions (16)