The nature of mathematical models
Andrea De Gaetano
TL;DR
The paper defines a rigorous, Hilbert-space–based foundation for mathematical modeling by treating a model as an operator on $\mathcal{H}=L^2(\Omega,\mathcal{F},P)$ with a model manifold $\mathcal{M}_{\boldsymbol{X}}$ and a prediction surface $\mathcal{M}_{\boldsymbol{x}}$ that inhabit $\mathcal{H}$ and $\mathbb{R}^n$ respectively. It develops a geometry-driven link between abstract modeling in $\mathcal{H}$ and concrete estimation in case space $\mathbb{R}^n$, using projections for linear cases and tangent-space (affine) linearizations for nonlinear cases, with Jacobians playing a central role. The work extends classical projection results to affine subspaces and provides a reparameterization approach to define local tangent spaces, connecting conditional estimation and information geometry concepts while clarifying how to map between abstract model structure and computational procedures. These contributions yield a principled framework for model identification and parameter estimation across disciplines, offering a precise language for the relationship between theoretical model construction and practical statistical inference.
Abstract
Modeling has become a widespread, useful tool in mathematics applied to diverse fields, from physics to economics to biomedicine. Practitioners of modeling may use algebraic or differential equations, to the elements of which they attribute an intuitive relationship with some relevant aspect of reality they wish to represent. More sophisticated expressions may include stochasticity, either as observation error or system noise. However, a clear, unambiguous mathematical definition of what a model is and of what is the relationship between the model and the real-life phenomena it purports to represent has so far not been formulated. The present work aims to fill this gap, motivating the definition of a mathematical model as an operator on a Hilbert space of random variables, identifying the experimental realization as the map between the theoretical space of model construction and the computational space of statistical model identification, and tracing the relationship of the geometry of the model manifold in the abstract setting with the corresponding geometry of the prediction surfaces in statistical estimation.
