Online Nonstochastic Control with Convex Safety Constraints

Nanfei Jiang; Spencer Hutchinson; Mahnoosh Alizadeh

Online Nonstochastic Control with Convex Safety Constraints

Nanfei Jiang, Spencer Hutchinson, Mahnoosh Alizadeh

TL;DR

This work addresses online nonstochastic control for linear time-invariant systems under general convex safety constraints and adversarial time-varying costs. It introduces the Online Gradient Descent with Buffer Zone for Convex Constraints (OGD-BZC), which couples a disturbance-action controller with projected online gradient descent over DAC weights to guarantee safety and achieve sublinear regret. Theoretical results establish deterministic safety under carefully chosen horizon, buffer, and stepsize parameters, along with a $\tilde{O}(\sqrt{T})$ regret bound that is, notably, independent of the number of individual constraints. Numerical experiments on a two-dimensional toy example illustrate robust performance and safety under adversarial disturbances, underscoring the approach’s practicality for safe online control in convex constraint settings.

Abstract

This paper considers the online nonstochastic control problem of a linear time-invariant system under convex state and input constraints that need to be satisfied at all times. We propose an algorithm called Online Gradient Descent with Buffer Zone for Convex Constraints (OGD-BZC), designed to handle scenarios where the system operates within general convex safety constraints. We demonstrate that OGD-BZC, with appropriate parameter selection, satisfies all the safety constraints under bounded adversarial disturbances. Additionally, to evaluate the performance of OGD-BZC, we define the regret with respect to the best safe linear policy in hindsight. We prove that OGD-BZC achieves $\tilde{O} (\sqrt{T})$ regret given proper parameter choices. Our numerical results highlight the efficacy and robustness of the proposed algorithm.

Online Nonstochastic Control with Convex Safety Constraints

TL;DR

Abstract

Online Nonstochastic Control with Convex Safety Constraints

TL;DR

Abstract

Paper Structure

Table of Contents

Key Result

Figures (2)

Theorems & Definitions (35)