Fundamentals of Regression
Miguel A. Mendez
TL;DR
The chapter surveys regression as learning a stochastic mapping from inputs to outputs, rooted in maximum likelihood and uncertainty quantification. It surveys parametric and non-parametric approaches, including linear bases, ANNs, kernel methods, and symbolic regression, and shows how these methods connect to physics via physics-informed cost functions and PINNs. It highlights bootstrapping, cross-validation, and robust loss functions as tools for generalization and resilience to outliers, and it outlines strategies to couple data-driven models with PDE-based physics (e.g., FEM, PINNs) to yield physics-consistent predictions. The work provides a framework for hybridizing machine learning with numerical methods for physics, enabling robust uncertainty quantification and improved physical fidelity across scientific computing tasks.
Abstract
This chapter opens with a review of classic tools for regression, a subset of machine learning that seeks to find relationships between variables. With the advent of scientific machine learning this field has moved from a purely data-driven (statistical) formalism to a constrained or ``physics-informed'' formalism, which integrates physical knowledge and methods from traditional computational engineering. In the first part, we introduce the general concepts and the statistical flavor of regression versus other forms of curve fitting. We then move to an overview of traditional methods from machine learning and their classification and ways to link these to traditional computational science. Finally, we close with a note on methods to combine machine learning and numerical methods for physics
