Computing AD-compatible subgradients of convex relaxations of implicit functions

Yingkai Song; Kamil A. Khan

Computing AD-compatible subgradients of convex relaxations of implicit functions

Yingkai Song, Kamil A. Khan

TL;DR

This work introduces the first subgradient propagation rules for implicit-function convex relaxations constructed as optimal-value problems, enabling their integration into forward AD workflows. By distinguishing cases based on the dimensions of the implicit variables $n_x$ and parameters $n_p$, the authors develop closed-form subgradients for $n_x=1$, LP-based directional derivatives for small $n_p$, and lexicographic derivatives for larger $n_p$, all while maintaining compatibility with existing McCormick-style relaxations. The approach leverages residual relaxations, piecewise differentiable relaxations, and Slater-type conditions to derive practical subgradient evaluations, demonstrated through proof-of-concept Julia implementations on thermodynamic and reactor-design problems. The results enable AD-compatible inclusion of implicit-function relaxations in global optimization pipelines, potentially improving lower-bound computations and sensitivity analyses in a wide range of applications. Overall, the paper advances the toolkit for nonsmooth optimization by bridging implicit function relaxations with forward-mode AD and subgradient propagation.

Abstract

Automatic generation of convex relaxations and subgradients is critical in global optimization, and is typically carried out using variants of automatic/algorithmic differentiation (AD). At previous AD conferences, variants of the forward and reverse AD modes were presented to evaluate accurate subgradients for convex relaxations of supplied composite functions. In a recent approach for generating convex relaxations of implicit functions, these relaxations are constructed as optimal-value functions; this formulation is versatile but complicates sensitivity analysis. We present the first subgradient propagation rules for these implicit function relaxations, based on supplied AD-like knowledge of the residual function. Our new subgradient rules allow implicit function relaxations to be added to the elemental function libraries for the forward AD modes for subgradient propagation of convex relaxations. Proof-of-concept numerical results in Julia are presented.

Computing AD-compatible subgradients of convex relaxations of implicit functions

TL;DR

Abstract

Computing AD-compatible subgradients of convex relaxations of implicit functions

TL;DR

Abstract

Paper Structure

Table of Contents

Key Result

Figures (3)

Theorems & Definitions (17)