Wasserstein Distributionally Robust Regret-Optimal Control in the Infinite-Horizon
Taylan Kargin, Joudi Hajar, Vikrant Malik, Babak Hassibi
TL;DR
The paper develops distributionally robust regret-optimal control for discrete-time LTI systems over an infinite horizon, using a Wasserstein-2 ambiguity set to hedge against distributional uncertainty in time-correlated disturbances. By applying a strong duality framework, it reduces the problem to a suboptimal, gamma-parameterized formulation and derives a frequency-domain fixed-point method to synthesize the controller, despite the optimal policy being non-rational. The gamma-suboptimal controller is characterized by a finite-dimensional parameter and a spectral-factor based relation, enabling efficient computation via a fixed-point iterations and spectral factorization, while guaranteeing stabilization for $eta>eta_{ m RO}$. The approach interpolates between $H_2$ and regret-optimal (RO) performance as the ambiguity radius changes, and numerical experiments on aircraft-like systems demonstrate superior worst-case regret performance with favorable computation times compared to horizon-dependent SDPs. This work advances robust control by accommodating arbitrarily correlated disturbances and providing scalable infinite-horizon synthesis methods with practical applicability.
Abstract
We investigate the Distributionally Robust Regret-Optimal (DR-RO) control of discrete-time linear dynamical systems with quadratic cost over an infinite horizon. Regret is the difference in cost obtained by a causal controller and a clairvoyant controller with access to future disturbances. We focus on the infinite-horizon framework, which results in stability guarantees. In this DR setting, the probability distribution of the disturbances resides within a Wasserstein-2 ambiguity set centered at a specified nominal distribution. Our objective is to identify a control policy that minimizes the worst-case expected regret over an infinite horizon, considering all potential disturbance distributions within the ambiguity set. In contrast to prior works, which assume time-independent disturbances, we relax this constraint to allow for time-correlated disturbances, thus actual distributional robustness. While we show that the resulting optimal controller is non-rational and lacks a finite-dimensional state-space realization, we demonstrate that it can still be uniquely characterized by a finite dimensional parameter. Exploiting this fact, we introduce an efficient numerical method to compute the controller in the frequency domain using fixed-point iterations. This method circumvents the computational bottleneck associated with the finite-horizon problem, where the semi-definite programming (SDP) solution dimension scales with the time horizon. Numerical experiments demonstrate the effectiveness and performance of our framework.
