Solving Functional Optimization with Deep Networks and Variational Principles
Kawisorn Kamtue, Jose M. F. Moura, Orathai Sangpetch
TL;DR
This work tackles functional optimization with moving boundaries, where terminal time and state are unknown, by embedding the calculus of variations into a neural network framework. CalVNet learns state, control, and costate trajectories (and possibly the terminal time) without ground-truth data by minimizing $\Psi=\sum_i \|\psi_i\|^2$ to enforce variational optimality, i.e., $\delta\mathcal{J}=0$. The authors validate on three canonical problems: reproducing the Kalman filter in a variational setting, deriving bang-bang minimum-time control, and computing geodesics on manifolds, obtaining results that match analytical solutions and demonstrating robustness beyond fixed horizons. This principled, unsupervised approach offers a data-efficient pathway to solve complex functional optimization problems where traditional methods struggle due to moving boundaries or unknown supports.
Abstract
Can neural networks solve math problems using first a principle alone? This paper shows how to leverage the fundamental theorem of the calculus of variations to design deep neural networks to solve functional optimization without requiring training data (e.g., ground-truth optimal solutions). Our approach is particularly crucial when the solution is a function defined over an unknown interval or support\textemdash such as in minimum-time control problems. By incorporating the necessary conditions satisfied by the optimal function solution, as derived from the calculus of variation, in the design of the deep architecture, CalVNet leverages overparameterized neural networks to learn these optimal functions directly. We validate CalVNet by showing that, without relying on ground-truth data and simply incorporating first principles, it successfully derives the Kalman filter for linear filtering, the bang-bang optimal control for minimum-time problems, and finds geodesics on manifolds. Our results demonstrate that CalVNet can be trained in an unsupervised manner, without relying on ground-truth data, establishing a promising framework for addressing general, potentially unsolved functional optimization problems that still lack analytical solutions.
