The Neural Network Approach to Inverse Problems in Differential Equations
Kailai Xu, Eric Darve
TL;DR
This work develops a neural-network-based framework to solve inverse problems in differential equations by learning unknown functions inside PDEs with automatic differentiation and deferred discretization. It performs a rigorous error analysis that decomposes total error into observation noise, discretization (consistency), and optimization components, deriving a bound for diffusion problems that links |f_theta(x) - f(x)| to Δt, h, ε_opt, and ε_o. The methodology enables end-to-end calibration by evaluating neural networks within forward solvers and using a data-driven loss, while also providing a sensitivity analysis to quantify how small changes in network parameters affect quantities of interest. Numerical experiments on diffusion, wave, and Burgers-type equations demonstrate second-order convergence with data refinement, robustness to noise, and the ability to recover complex, spatially varying coefficients. The framework offers a scalable, practical approach to inverse PDE problems with potential extensions to tailored AD tools, advanced architectures, and uncertainty quantification through sensitivity metrics.
Abstract
We proposed a framework for solving inverse problems in differential equations based on neural networks and automatic differentiation. Neural networks are used to approximate hidden fields. We analyze the source of errors in the framework and derive an error estimate for a model diffusion equation problem. Besides, we propose a way for sensitivity analysis, utilizing the automatic differentiation mechanism embedded in the framework. It frees people from the tedious and error-prone process of deriving the gradients. Numerical examples exhibit consistency with the convergence analysis and error saturation is noteworthily predicted. We also demonstrate the unique benefits neural networks offer at the same time: universal approximation ability, regularizing the solution, bypassing the curse of dimensionality and leveraging efficient computing frameworks.
