Are Statistical Methods Obsolete in the Era of Deep Learning?
Skyler Wu, Shihao Yang, S. C. Kou
TL;DR
The paper investigates whether statistically principled methods remain relevant in the era of deep learning by comparing PINN, a physics-informed neural network, with MAGI, a Bayesian Gaussian-process-based ODE inference method, on SEIR and Lorenz models under sparse and noisy data. MAGI generally offers robust parameter inference, accurate trajectory reconstruction, and credible uncertainty quantification, particularly when data are limited or components are unobserved, while PINN’s performance is highly sensitive to hyperparameters and can struggle with out-of-sample forecasts in chaotic or partially observed settings. The results advocate for the continued value of statistical approaches in scientific modeling and highlight the potential of integrating probabilistic ODE inference with neural methods to leverage the strengths of both paradigms. Overall, the work emphasizes forecasting reliability, mechanistic fidelity, and uncertainty quantification as key benefits of MAGI, even as deep learning advances continue to transform data-driven modeling.
Abstract
In the era of AI, neural networks have become increasingly popular for modeling, inference, and prediction, largely due to their potential for universal approximation. With the proliferation of such deep learning models, a question arises: are leaner statistical methods still relevant? To shed insight on this question, we employ the mechanistic nonlinear ordinary differential equation (ODE) inverse problem as a testbed, using physics-informed neural network (PINN) as a representative of the deep learning paradigm and manifold-constrained Gaussian process inference (MAGI) as a representative of statistically principled methods. Through case studies involving the SEIR model from epidemiology and the Lorenz model from chaotic dynamics, we demonstrate that statistical methods are far from obsolete, especially when working with sparse and noisy observations. On tasks such as parameter inference and trajectory reconstruction, statistically principled methods consistently achieve lower bias and variance, while using far fewer parameters and requiring less hyperparameter tuning. Statistical methods can also decisively outperform deep learning models on out-of-sample future prediction, where the absence of relevant data often leads overparameterized models astray. Additionally, we find that statistically principled approaches are more robust to accumulation of numerical imprecision and can represent the underlying system more faithful to the true governing ODEs.
