Kernel-Based Optimal Control: An Infinitesimal Generator Approach
Petar Bevanda, Nicolas Hoischen, Tobias Wittmann, Jan Brüdigam, Sandra Hirche, Boris Houska
TL;DR
The paper addresses data-driven optimal control for nonlinear stochastic difusions by learning the infinitesimal generator of controlled diffusions in an RKHS. It develops a Kernel Hamilton-Jacobi-Bellman framework ($KHJB$) that leverages convex reformulations of the ergodic control problem via a PDE-constrained dual, enabling global-ish solutions without explicit parametric models. The methodology hinges on Hilbert-Schmidt regression in infinite dimensions to nonparametrically estimate the adjoints of the generator, producing an explicit finite-dimensional backward value function and feedback policy through a data-driven KHJB recursion. Numerical results on synthetic oscillators and robotic systems demonstrate superior robustness and performance against modern data-driven and classical nonlinear programming methods, highlighting practical impact for scalable, model-free control under uncertainty.
Abstract
This paper presents a novel operator-theoretic approach for optimal control of nonlinear stochastic systems within reproducing kernel Hilbert spaces. Our learning framework leverages data samples of system dynamics and stage cost functions, with only control penalties and constraints provided. The proposed method directly learns the infinitesimal generator of a controlled stochastic diffusion in an infinite-dimensional hypothesis space. We demonstrate that our approach seamlessly integrates with modern convex operator-theoretic Hamilton-Jacobi-Bellman recursions, enabling a data-driven solution to the optimal control problems. Furthermore, our learning framework includes nonparametric estimators for uncontrolled infinitesimal generators as a special case. Numerical experiments, ranging from synthetic differential equations to simulated robotic systems, showcase the advantages of our approach compared to both modern data-driven and classical nonlinear programming methods for optimal control.
