Table of Contents
Fetching ...

DRM Revisited: A Complete Error Analysis

Yuling Jiao, Ruoxuan Li, Peiying Wu, Jerry Zhijian Yang, Pingwen Zhang

TL;DR

This work delivers the first complete end-to-end error analysis for solving elliptic PDEs with the Deep Ritz Method in the over-parameterized regime. It introduces a novel error decomposition that bounds approximation, optimization, and statistical errors simultaneously, and demonstrates that the gradient-descent-based DRM can achieve a target accuracy with explicit, dimension-aware parameter scaling. The key innovation is a tighter optimization error bound that does not require the training dynamics to stay near initialization, making the theory align more closely with practical training. The results provide practitioners with principled guidance on network width, depth, weight bounds, and sampling requirements to guarantee end-to-end convergence with high probability, advancing the theoretical understanding of DRM and over-parameterized neural PDE solvers. This framework paves the way for applying similar complete error analyses to other deep-PDE solvers such as PINNs and related variational formulations.

Abstract

In this work, we address a foundational question in the theoretical analysis of the Deep Ritz Method (DRM) under the over-parameteriztion regime: Given a target precision level, how can one determine the appropriate number of training samples, the key architectural parameters of the neural networks, the step size for the projected gradient descent optimization procedure, and the requisite number of iterations, such that the output of the gradient descent process closely approximates the true solution of the underlying partial differential equation to the specified precision?

DRM Revisited: A Complete Error Analysis

TL;DR

This work delivers the first complete end-to-end error analysis for solving elliptic PDEs with the Deep Ritz Method in the over-parameterized regime. It introduces a novel error decomposition that bounds approximation, optimization, and statistical errors simultaneously, and demonstrates that the gradient-descent-based DRM can achieve a target accuracy with explicit, dimension-aware parameter scaling. The key innovation is a tighter optimization error bound that does not require the training dynamics to stay near initialization, making the theory align more closely with practical training. The results provide practitioners with principled guidance on network width, depth, weight bounds, and sampling requirements to guarantee end-to-end convergence with high probability, advancing the theoretical understanding of DRM and over-parameterized neural PDE solvers. This framework paves the way for applying similar complete error analyses to other deep-PDE solvers such as PINNs and related variational formulations.

Abstract

In this work, we address a foundational question in the theoretical analysis of the Deep Ritz Method (DRM) under the over-parameteriztion regime: Given a target precision level, how can one determine the appropriate number of training samples, the key architectural parameters of the neural networks, the step size for the projected gradient descent optimization procedure, and the requisite number of iterations, such that the output of the gradient descent process closely approximates the true solution of the underlying partial differential equation to the specified precision?
Paper Structure (39 sections, 44 theorems, 283 equations, 1 figure, 1 table)

This paper contains 39 sections, 44 theorems, 283 equations, 1 figure, 1 table.

Key Result

Proposition 2.1

For any $u \in H^1(\Omega)$, it holds that

Figures (1)

  • Figure 1: This figure illustrates the structure of the Parallel Neural Network. The structure within the red box represents the sub-neural networks, which are fully connected networks, where the dark-colored nodes signify activation functions.

Theorems & Definitions (84)

  • Proposition 2.1
  • proof
  • Remark 2.1
  • Theorem 2.1
  • proof
  • Remark 2.2
  • Remark 2.3
  • Theorem 3.1
  • proof
  • Remark 3.1
  • ...and 74 more