SODAs: Sparse Optimization for the Discovery of Differential and Algebraic Equations
Manu Jayadharan, Christina Catlett, Arthur N. Montanari, Niall M. Mangan
TL;DR
SODAs introduces a principled framework for data-driven discovery of differential-algebraic equations by separating the identification of algebraic constraints from the differential dynamics, thereby preserving physical structure and mitigating multicollinearity. The method iteratively refines a library of candidate terms using sparse regression to uncover algebraic relations, uses singular value decomposition to guide stopping, and then applies conventional ODE discovery to the remaining dynamics on a refined library. Across chemical reaction networks, pendulum systems, and power-grid models, SODAs demonstrates robust recovery of DAEs and reduced coordinates under realistic noise and data limitations, outperforming implicit-SINDy-like approaches that rely on a priori variable identification. The work provides a scalable, interpretable route to model discovery in complex, constrained dynamical systems with practical implications for biology, mechanics, and energy networks.
Abstract
Differential-algebraic equations (DAEs) integrate ordinary differential equations (ODEs) with algebraic constraints, providing a fundamental framework for developing models of dynamical systems characterized by timescale separation, conservation laws, and physical constraints. While sparse optimization has revolutionized model development by allowing data-driven discovery of parsimonious models from a library of possible equations, existing approaches for dynamical systems assume DAEs can be reduced to ODEs by eliminating variables before model discovery. This assumption limits the applicability of such methods to DAE systems with unknown constraints and time scales. We introduce Sparse Optimization for Differential-Algebraic Systems (SODAs), a data-driven method for the identification of DAEs in their explicit form. By discovering the algebraic and dynamic components sequentially without prior identification of the algebraic variables, this approach leads to a sequence of convex optimization problems and has the advantage of discovering interpretable models that preserve the structure of the underlying physical system. To this end, SODAs improves numerical stability when handling high correlations between library terms -- caused by near-perfect algebraic relationships -- by iteratively refining the conditioning of the candidate library. We demonstrate the performance of our method on biological, mechanical, and electrical systems, showcasing its robustness to noise in both simulated time series and real-time experimental data.
