Universal Differential Equations for Scientific Machine Learning
Christopher Rackauckas, Yingbo Ma, Julius Martensen, Collin Warner, Kirill Zubov, Rohit Supekar, Dominic Skinner, Ali Ramadhan, Alan Edelman
TL;DR
The paper introduces universal differential equations (UDEs) and the SciML ecosystem as a unified approach to blend physical laws with data-driven models, enabling efficient training, stable adjoint-based gradients, and scalable solutions for high-dimensional and stochastic systems. It demonstrates knowledge-guided symbolic regression, high-dimensional PDE solving via USDEs, and automated closure learning across geophysical and rheological contexts, achieving substantial speedups and data efficiency. Key contributions include a comprehensive adjoint framework, extensive benchmarking against standard ML-DE tools, and practical demonstrations of equation discovery, model reduction, and closure parameterization. Collectively, these advances offer a scalable, physics-informed toolkit for scientific machine learning with broad applicability across engineering, climate science, and materials modeling.
Abstract
In the context of science, the well-known adage "a picture is worth a thousand words" might well be "a model is worth a thousand datasets." In this manuscript we introduce the SciML software ecosystem as a tool for mixing the information of physical laws and scientific models with data-driven machine learning approaches. We describe a mathematical object, which we denote universal differential equations (UDEs), as the unifying framework connecting the ecosystem. We show how a wide variety of applications, from automatically discovering biological mechanisms to solving high-dimensional Hamilton-Jacobi-Bellman equations, can be phrased and efficiently handled through the UDE formalism and its tooling. We demonstrate the generality of the software tooling to handle stochasticity, delays, and implicit constraints. This funnels the wide variety of SciML applications into a core set of training mechanisms which are highly optimized, stabilized for stiff equations, and compatible with distributed parallelism and GPU accelerators.
