Infinite-Horizon Distributionally Robust Regret-Optimal Control
Taylan Kargin, Joudi Hajar, Vikrant Malik, Babak Hassibi
TL;DR
The paper tackles infinite-horizon distributionally robust regret-optimal control for discrete-time LTI systems under a Wasserstein-2 ambiguity set around disturbances, aiming to minimize the steady-state worst-case expected regret relative to a clairvoyant non-causal policy. It derives a tractable saddle-point formulation and proves strong duality, showing the optimal DR-RO controller is non-rational but admits a finite-dimensional parameterization that can be computed efficiently in the frequency domain using a Frank-Wolfe-type algorithm. A key contribution is a convex, LMIs-based procedure to approximate the non-rational controller with a near-optimal rational $H_inity$-norm state-space controller, enabling practical real-time implementations. The approach yields stability guarantees and robustness to time-correlated disturbances, and its empirical results demonstrate superior performance of the infinite-horizon DR-RO controller over finite-horizon methods and standard $H_2$/$H_\infty$ baselines, with rational approximations closely matching non-rational performance. Overall, the work delivers a scalable, implementable framework for DR regret-optimal control under distributional uncertainty and time correlation, bridging theory and practical controller design.
Abstract
We study the infinite-horizon distributionally robust (DR) control of linear systems with quadratic costs, where disturbances have unknown, possibly time-correlated distribution within a Wasserstein-2 ambiguity set. We aim to minimize the worst-case expected regret-the excess cost of a causal policy compared to a non-causal one with access to future disturbance. Though the optimal policy lacks a finite-order state-space realization (i.e., it is non-rational), it can be characterized by a finite-dimensional parameter. Leveraging this, we develop an efficient frequency-domain algorithm to compute this optimal control policy and present a convex optimization method to construct a near-optimal state-space controller that approximates the optimal non-rational controller in the $\mathit{H}_\infty$-norm. This approach avoids solving a computationally expensive semi-definite program (SDP) that scales with the time horizon in the finite-horizon setting.
