Table of Contents
Fetching ...

Robustness and Consistency in Linear Quadratic Control with Untrusted Predictions

Tongxin Li, Ruixiao Yang, Guannan Qu, Guanya Shi, Chenkai Yu, Adam Wierman, Steven H. Low

TL;DR

This work addresses learning-augmented linear quadratic control with untrusted predictions by formalizing a robustness-consistency trade-off via competitive ratio bounds. It introduces the $\lambda$-confident control that blends a fully prediction-trusting policy with a prediction-agnostic policy, and derives a bound on the competitive ratio that depends on the trust parameter and prediction error. To avoid exogenously fixing the trust level, the authors propose a self-tuning policy that adaptively updates $\lambda_t$ online, achieving regret $O((\mu_{\mathsf{VAR}}(\mathbf{w})+\mu_{\mathsf{VAR}}(\mathbf{\hat w}))\log T)$ and a competitive ratio of the form $CR(\varepsilon) \le 1 + 2||H|| \varepsilon/(OPT + C\varepsilon) + O(((\mu_{\mathsf{VAR}}(\mathbf{w})+\mu_{\mathsf{VAR}}(\mathbf{\hat w}))\log T)/OPT)$. The approach converges under mild variation assumptions and is validated on robotics tracking, EV charging, and Cart-Pole tasks, showing near-optimal performance across a spectrum of prediction quality. Overall, the work provides a practical framework for safely leveraging black-box AI predictions in control systems with provable guarantees and broad applicability.

Abstract

We study the problem of learning-augmented predictive linear quadratic control. Our goal is to design a controller that balances \textit{"consistency"}, which measures the competitive ratio when predictions are accurate, and \textit{"robustness"}, which bounds the competitive ratio when predictions are inaccurate. We propose a novel $λ$-confident policy and provide a competitive ratio upper bound that depends on a trust parameter $λ\in [0,1]$ set based on the confidence in the predictions and some prediction error $\varepsilon$. Motivated by online learning methods, we design a self-tuning policy that adaptively learns the trust parameter $λ$ with a competitive ratio that depends on $\varepsilon$ and the variation of system perturbations and predictions. We show that its competitive ratio is bounded from above by $ 1+{O(\varepsilon)}/({Θ(1)+Θ(\varepsilon)})+O(μ_{\mathsf{Var}})$ where $μ_\mathsf{Var}$ measures the variation of perturbations and predictions. It implies that when the variations of perturbations and predictions are small, by automatically adjusting the trust parameter online, the self-tuning scheme ensures a competitive ratio that does not scale up with the prediction error $\varepsilon$.

Robustness and Consistency in Linear Quadratic Control with Untrusted Predictions

TL;DR

This work addresses learning-augmented linear quadratic control with untrusted predictions by formalizing a robustness-consistency trade-off via competitive ratio bounds. It introduces the -confident control that blends a fully prediction-trusting policy with a prediction-agnostic policy, and derives a bound on the competitive ratio that depends on the trust parameter and prediction error. To avoid exogenously fixing the trust level, the authors propose a self-tuning policy that adaptively updates online, achieving regret and a competitive ratio of the form . The approach converges under mild variation assumptions and is validated on robotics tracking, EV charging, and Cart-Pole tasks, showing near-optimal performance across a spectrum of prediction quality. Overall, the work provides a practical framework for safely leveraging black-box AI predictions in control systems with provable guarantees and broad applicability.

Abstract

We study the problem of learning-augmented predictive linear quadratic control. Our goal is to design a controller that balances \textit{"consistency"}, which measures the competitive ratio when predictions are accurate, and \textit{"robustness"}, which bounds the competitive ratio when predictions are inaccurate. We propose a novel -confident policy and provide a competitive ratio upper bound that depends on a trust parameter set based on the confidence in the predictions and some prediction error . Motivated by online learning methods, we design a self-tuning policy that adaptively learns the trust parameter with a competitive ratio that depends on and the variation of system perturbations and predictions. We show that its competitive ratio is bounded from above by where measures the variation of perturbations and predictions. It implies that when the variations of perturbations and predictions are small, by automatically adjusting the trust parameter online, the self-tuning scheme ensures a competitive ratio that does not scale up with the prediction error .

Paper Structure

This paper contains 32 sections, 15 theorems, 107 equations, 8 figures, 2 tables, 3 algorithms.

Key Result

Theorem 1

Under our model assumptions, there is a self-tuning online control algorithm that selects some $\lambda_t\in [0,1]$ for all $t=0,\ldots,T-1$ and achieves a competitive ratio as a function of the prediction error $\varepsilon$ where $\mu_\mathsf{Var}$ measures the variation of perturbations and predictions.

Figures (8)

  • Figure 1: System model of linear quadratic control (LQC) with untrusted predictions.
  • Figure 2: Tracking trajectories and trust parameters $(\lambda_0,\ldots,\lambda_{T-1})$ of the self-tuning control scheme. The x-axis and y-axis in the top $6$ figures are locations of the robot. The y-axis in the bottom $3$ figures denotes the value of the trust parameter.
  • Figure 3: Impact of trust parameters and performance of self-tuning control for robot tracking.
  • Figure 4: Adaptive battery-buffered EV charging modelled as a linear quadratic control problem.
  • Figure 5: An example of the daily charging demands in ACN-Data lee2019acn on Nov 1st, $2018$.
  • ...and 3 more figures

Theorems & Definitions (24)

  • Theorem : Informal
  • Definition 1.1
  • Definition 1.2
  • Theorem 1.1: Theorem 3.2 in yu2020power
  • Theorem 2.1
  • Theorem 2.2
  • Lemma 1
  • Lemma 2
  • Theorem 3.1
  • proof : Proof of Theorem \ref{['thm:competitive_self_tuning']}
  • ...and 14 more