Differential estimates for fast first-order multilevel nonconvex optimisation
Neil Dizon, Tuomo Valkonen
TL;DR
This work develops a general single-loop framework for differential estimates in bilevel and multilevel optimization, enabling outer updates with inexact gradients computed from flexible inner and adjoint solvers. Central to the theory are tracking inequalities that govern inner/adjoint progress and a differential transformation bound, which together yield Lipschitz-type and descent properties for the approximate outer differential $\widetilde{F'}$. The authors extend convergence analysis to nonconvex forward-backward and primal-dual proximal splitting schemes under operator-relative regularity, proving subdifferential convergence, quasi-Féjer monotonicity, and weak/ergodic convergence in normed spaces. The framework accommodates mismatched adjoints, linear-system/PDE inner problems, and measure-space settings, and it provides constructive guidance for implementing efficient single-loop bilevel solvers with guaranteed stability and convergence.
Abstract
With a view on bilevel and PDE-constrained optimisation, we develop iterative estimates $\widetilde{F'}(x^k)$ of $F'(x^k)$ for composite functions $F :=J \circ S$, where $S$ is the solution mapping of the inner optimisation problem or PDE. The idea is to form a single-loop method by interweaving updates of the iterate $x^k$ by an outer optimisation method, with updates of the estimate by single steps of standard optimisation methods and linear system solvers. When the inner methods satisfy simple tracking inequalities, the differential estimates can almost directly be employed in standard convergence proofs for general forward-backward type methods. We adapt those proofs to a general inexact setting in normed spaces, that, besides our differential estimates, also covers mismatched adjoints and unreachable optimality conditions in measure spaces. As a side product of these efforts, we provide improved convergence results for nonconvex Primal-Dual Proximal Splitting (PDPS).
