A General and Streamlined Differentiable Optimization Framework
Andrew W. Rosemberg, Joaquim Dias Garcia, François Pacaud, Robert B. Parker, Benoît Legat, Kaarthik Sundar, Russell Bent, Pascal Van Hentenryck
TL;DR
This work addresses differentiating through constrained optimization at scale by delivering a general, JuMP-native framework (DiffOpt.jl) that unifies modeling and differentiation. It differentiates the KKT system to provide forward- and reverse-mode sensitivities for smooth, potentially nonconvex problems, supported by a parameter-centric API that exposes derivatives with respect to named problem parameters. Key contributions include extending DiffOpt.jl to nonconvex NLPs, enabling objective sensitivities with respect to explicit parameters, and preserving seamless solver integration via JuMP/MathOptInterface. Demonstrations on energy dispatch, mean–variance portfolio optimization, and nonlinear robot inverse kinematics illustrate practical applicability to learning, calibration, and design within standard JuMP workflows.
Abstract
Differentiating through constrained optimization problems is increasingly central to learning, control, and large-scale decision-making systems, yet practical integration remains challenging due to solver specialization and interface mismatches. This paper presents a general and streamlined framework-an updated DiffOpt.jl-that unifies modeling and differentiation within the Julia optimization stack. The framework computes forward - and reverse-mode solution and objective sensitivities for smooth, potentially nonconvex programs by differentiating the KKT system under standard regularity assumptions. A first-class, JuMP-native parameter-centric API allows users to declare named parameters and obtain derivatives directly with respect to them - even when a parameter appears in multiple constraints and objectives - eliminating brittle bookkeeping from coefficient-level interfaces. We illustrate these capabilities on convex and nonconvex models, including economic dispatch, mean-variance portfolio selection with conic risk constraints, and nonlinear robot inverse kinematics. Two companion studies further demonstrate impact at scale: gradient-based iterative methods for strategic bidding in energy markets and Sobolev-style training of end-to-end optimization proxies using solver-accurate sensitivities. Together, these results demonstrate that differentiable optimization can be deployed as a routine tool for experimentation, learning, calibration, and design-without deviating from standard JuMP modeling practices and while retaining access to a broad ecosystem of solvers.
