Learning to Control: The iUzawa-Net for Nonsmooth Optimal Control of Linear PDEs
Yongcun Song, Xiaoming Yuan, Hangrui Yue, Tianyou Zeng
TL;DR
The paper tackles the challenge of real-time solution of parameterized nonsmooth optimal control problems governed by linear PDEs. It introduces iUzawa-Net, an optimization-informed neural network that unrolls an inexact Uzawa method and replaces PDE solves with learnable surrogates for the state operator and preconditioners, enabling fast inference after a single offline training phase. The authors establish universal approximation properties and asymptotic $\varepsilon$-optimality for the layer outputs, and provide weight-tying results under mild regularity, ensuring robust performance with reduced model complexity. Numerical experiments on elliptic and parabolic problems demonstrate high accuracy with few layers, strong generalization including zero-shot super-resolution, and substantial speedups over classical solvers, highlighting the practical impact of optimization-informed neural networks for PDE-constrained optimization.
Abstract
We propose an optimization-informed deep neural network approach, named iUzawa-Net, aiming for the first solver that enables real-time solutions for a class of nonsmooth optimal control problems of linear partial differential equations (PDEs). The iUzawa-Net unrolls an inexact Uzawa method for saddle point problems, replacing classical preconditioners and PDE solvers with specifically designed learnable neural networks. We prove universal approximation properties and establish the asymptotic $\varepsilon$-optimality for the iUzawa-Net, and validate its promising numerical efficiency through nonsmooth elliptic and parabolic optimal control problems. Our techniques offer a versatile framework for designing and analyzing various optimization-informed deep learning approaches to optimal control and other PDE-constrained optimization problems. The proposed learning-to-control approach synergizes model-based optimization algorithms and data-driven deep learning techniques, inheriting the merits of both methodologies.
