Learning to Control PDEs with Differentiable Predictive Control and Time-Integrated Neural Operators

Dibakar Roy Sarkar; Ján Drgoňa; Somdatta Goswami

Learning to Control PDEs with Differentiable Predictive Control and Time-Integrated Neural Operators

Dibakar Roy Sarkar, Ján Drgoňa, Somdatta Goswami

TL;DR

The paper addresses PDE-constrained optimal control in infinite-dimensional spaces by replacing online PDE solvers with differentiable neural operator surrogates. It introduces Time-Integrated Deep Operator Networks (TI-DON) to learn the instantaneous temporal derivative $\partial u/\partial t$ and couples them with classical integrators to preserve causal evolution, enabling stable long-horizon predictions. These surrogates are integrated with Differentiable Predictive Control (DPC) to train a parametric neural policy offline via backpropagation through the closed-loop dynamics, eliminating the need for supervisory controllers. Empirical results on the heat, Burgers', and Fisher-KPP equations demonstrate accurate target tracking, shock mitigation, and population-density control, with policies generalizing across initial conditions and problem parameters and transferring to high-fidelity finite-difference solvers. Open-source code supports reproducibility and further research in PDE-constrained, model-based self-supervised control.

Abstract

We present an end-to-end learning to control framework for partial differential equations (PDEs). Our approach integrates Time-Integrated Deep Operator Networks (TI-DeepONets) as differentiable PDE surrogate models within the Differentiable Predictive Control (DPC)-a self-supervised learning framework for constrained neural control policies. The TI-DeepONet architecture learns temporal derivatives and couples them with numerical integrators, thus preserving the temporal causality of infinite-dimensional PDEs while reducing error accumulation in long-horizon predictions. Within DPC, we leverage automatic differentiation to compute policy gradients by backpropagating the expectations of optimal control loss through the learned TI-DeepONet, enabling efficient offline optimization of neural policies without the need for online optimization or supervisory controllers. We empirically demonstrate that the proposed method learns feasible parametric policies across diverse PDE systems, including the heat, the nonlinear Burgers', and the reaction-diffusion equations. The learned policies achieve target tracking, constraint satisfaction, and curvature minimization objectives, while generalizing across distributions of initial conditions and problem parameters. These results highlight the promise of combining operator learning with DPC for scalable, model-based self-supervised learning in PDE-constrained optimal control.

Learning to Control PDEs with Differentiable Predictive Control and Time-Integrated Neural Operators

TL;DR

Abstract

Learning to Control PDEs with Differentiable Predictive Control and Time-Integrated Neural Operators

TL;DR

Abstract

Paper Structure

Table of Contents

Figures (4)