Data-Driven, ML-assisted Approaches to Problem Well-Posedness
Tom Bertalan, George A. Kevrekidis, Eleni D Koronaki, Siddhartha Mishra, Elizaveta Rebrova, Yannis G. Kevrekidis
TL;DR
The paper addresses how to infer well-posedness of differential equation problems from data, challenging classical a priori assumptions by using a data-driven framework that combines Physics-Informed Neural Networks (PINNs) with manifold learning and randomized linear algebra. It demonstrates, across both ODEs and PDEs, that sampling many randomized solves and analyzing the resulting ensembles with PCA reveals the effective dimensionality of the solution space and the local richness of information provided by data patches. In the ODE harmonic oscillator $x''(t)=-x(t)$, the number and placement of constraints determine whether the solution space is 1D, 2D, or empty, while in the wave equation the discretized system $A\mathbf{u}=\mathbf{b}$ shows well-posed regions with low ensemble variance and underconstrained regions with higher variance; the KS equation further shows how nonlinear PDEs exhibit localized underdetermination in data-poor regions. For overconstrained cases, the authors apply randomized iterative methods (e.g., QuantileSCRK) to drop inconsistent equations and recover meaningful, potentially weak solutions. Overall, the work provides a practical, data-driven pipeline to diagnose, quantify, and potentially rectify well-posedness issues in differential equations, bridging ML-based solvers with mathematical analysis.
Abstract
Classically, to solve differential equation problems, it is necessary to specify sufficient initial and/or boundary conditions so as to allow the existence of a unique solution. Well-posedness of differential equation problems thus involves studying the existence and uniqueness of solutions, and their dependence to such pre-specified conditions. However, in part due to mathematical necessity, these conditions are usually specified "to arbitrary precision" only on (appropriate portions of) the boundary of the space-time domain. This does not mirror how data acquisition is performed in realistic situations, where one may observe entire "patches" of solution data at arbitrary space-time locations; alternatively one might have access to more than one solutions stemming from the same differential operator. In our short work, we demonstrate how standard tools from machine and manifold learning can be used to infer, in a data driven manner, certain well-posedness features of differential equation problems, for initial/boundary condition combinations under which rigorous existence/uniqueness theorems are not known. Our study naturally combines a data assimilation perspective with an operator-learning one.
