PAC-Bayesian Optimal Control with Stability and Generalization Guarantees
Mahrokh Ghoddousi Boroujeni, Clara Lucía Galimberti, Andreas Krause, Giancarlo Ferrari-Trecate
TL;DR
This work addresses generalization in stochastic nonlinear optimal control by extending PAC-Bayes theory to SNOC and providing a randomized, stability-guaranteed framework. It introduces a Gibbs posterior over stabilizing controller parameters, coupled with tractable approximations (SVGD, normalizing flows) and a two-stage inference scheme to tighten bounds. The approach yields a principled balance between empirical performance and prior knowledge, ensuring closed-loop stability via expressive neural controllers (REN/SSM) and data-driven priors. Empirical results on an LTI system and cooperative robotics tasks demonstrate improved generalization over empirical SNOC and highlight scalability and practical performance gains.
Abstract
Stochastic Nonlinear Optimal Control (SNOC) seeks to minimize a cost function that accounts for random disturbances acting on a nonlinear dynamical system. Since the expectation over all disturbances is generally intractable, a common surrogate is the empirical cost, obtained by averaging over a finite dataset of sampled noise realizations. This substitution, however, introduces the challenge of guaranteeing performance under unseen disturbances. The issue is particularly severe when the dataset is limited, as the trained controllers may overfit, leading to substantial gaps between their empirical cost and the deployment cost. In this work, we develop a PAC-Bayesian framework that establishes rigorous generalization bounds for SNOC. Building on these bounds, we propose a principled controller design method that balances empirical performance and prior knowledge. To ensure tractability, we derive computationally efficient relaxations of the bounds and employ approximate inference methods. Our framework further leverages expressive neural controller parameterizations, guaranteeing closed-loop stability. Through simulated examples, we highlight how prior knowledge can be incorporated into control design and how more reliable controllers can be synthesized for cooperative robotics.
