On Minimax Optimal Dual Control for Fully Actuated Systems
Anders Rantzer
TL;DR
The article addresses robust adaptive control under worst-case model uncertainty by formulating a minimax dynamic game. It derives an exact, closed-form solution to the Bellman equation for fully actuated LTI-like systems with A,B uncertainty and unit B up to sign, leading to a dual controller that optimally trades off exploration and exploitation. A key innovation is compressing past data into an augmented state Z, enabling a clean DP over (x,Z) and yielding explicit expressions for the optimal cost via a max over uncertain parameters. The results generalize scalar minimax insights to vector systems and quantify when learning improves performance, with implications for designing adaptive controllers under adversarial disturbances.
Abstract
A multi-variable adaptive controller is derived as the explicit solution to a minimax dynamic game. The minimizing player selects the control action as a function of past state measurements and inputs. The maximizing player selects disturbances and model parameters for the underlying linear time-invariant dynamics. This leads to a Bellman equation that can be solved explicitly for the case with unitary B-matrix known up to a sign and no input penalty. The minimizing policy is a dual controller that optimizes the tradeoff between exploration and exploitation.
