Table of Contents
Fetching ...

Conformal Robust Control of Linear Systems

Yash Patel, Sahana Rayan, Ambuj Tewari

TL;DR

The paper addresses robust LQR design under dynamics misspecification by introducing a distribution-free conformal prediction framework to define uncertainty regions with provable coverage. It develops Conformalized Predict-Then-Control (CPC), a policy-gradient-based method that optimizes a robust objective over conformal uncertainty sets and provides convergence guarantees via gradient dominance. Empirical results across diverse engineering systems show CPC achieves lower robust regret and better stability than $\mathcal{H}_{\infty}$ and data-free margin methods, while calibration analyses validate the predicted coverage even with noisy dynamics. The approach enables principled, data-driven robustness in control co-design, with potential extensions to nonlinear settings and end-to-end CCD pipelines, offering practical impact for safe, reliable engineered systems.

Abstract

End-to-end engineering design pipelines, in which designs are evaluated using concurrently defined optimal controllers, are becoming increasingly common in practice. To discover designs that perform well even under the misspecification of system dynamics, such end-to-end pipelines have now begun evaluating designs with a robust control objective in place of the nominal optimal control setup. Current approaches of specifying such robust control subproblems, however, rely on hand specification of perturbations anticipated to be present upon deployment or margin methods that ignore problem structure, resulting in a lack of theoretical guarantees and overly conservative empirical performance. We, instead, propose a novel methodology for LQR systems that leverages conformal prediction to specify such uncertainty regions in a data-driven fashion. Such regions have distribution-free coverage guarantees on the true system dynamics, in turn allowing for a probabilistic characterization of the regret of the resulting robust controller. We then demonstrate that such a controller can be efficiently produced via a novel policy gradient method that has convergence guarantees. We finally demonstrate the superior empirical performance of our method over alternate robust control specifications, such as $H_{\infty}$ and LQR with multiplicative noise, across a collection of engineering control systems.

Conformal Robust Control of Linear Systems

TL;DR

The paper addresses robust LQR design under dynamics misspecification by introducing a distribution-free conformal prediction framework to define uncertainty regions with provable coverage. It develops Conformalized Predict-Then-Control (CPC), a policy-gradient-based method that optimizes a robust objective over conformal uncertainty sets and provides convergence guarantees via gradient dominance. Empirical results across diverse engineering systems show CPC achieves lower robust regret and better stability than and data-free margin methods, while calibration analyses validate the predicted coverage even with noisy dynamics. The approach enables principled, data-driven robustness in control co-design, with potential extensions to nonlinear settings and end-to-end CCD pipelines, offering practical impact for safe, reliable engineered systems.

Abstract

End-to-end engineering design pipelines, in which designs are evaluated using concurrently defined optimal controllers, are becoming increasingly common in practice. To discover designs that perform well even under the misspecification of system dynamics, such end-to-end pipelines have now begun evaluating designs with a robust control objective in place of the nominal optimal control setup. Current approaches of specifying such robust control subproblems, however, rely on hand specification of perturbations anticipated to be present upon deployment or margin methods that ignore problem structure, resulting in a lack of theoretical guarantees and overly conservative empirical performance. We, instead, propose a novel methodology for LQR systems that leverages conformal prediction to specify such uncertainty regions in a data-driven fashion. Such regions have distribution-free coverage guarantees on the true system dynamics, in turn allowing for a probabilistic characterization of the regret of the resulting robust controller. We then demonstrate that such a controller can be efficiently produced via a novel policy gradient method that has convergence guarantees. We finally demonstrate the superior empirical performance of our method over alternate robust control specifications, such as and LQR with multiplicative noise, across a collection of engineering control systems.
Paper Structure (36 sections, 17 theorems, 83 equations, 1 figure, 9 tables, 1 algorithm)

This paper contains 36 sections, 17 theorems, 83 equations, 1 figure, 9 tables, 1 algorithm.

Key Result

Theorem 3.4

Let $J(K, C) := \int_{0}^{\infty} (x(t)^\top (Q + K^\top R K) x(t)) dt$ for $w=0$. Assume that $\mathcal{P}_{\Theta,C}(C\in\mathcal{B}_{\widehat{q}}(f(\Theta))) \ge 1 - \alpha$. Then, under assump:opt_technical_1, where $L$ is the Lipschitz constant of $J(K,\widehat{C})$ in $\widehat{C}\in\mathcal{B}_{\widehat{q}}(f(\theta))$ under the operator norm. Further, if $\widehat{q} < r(C,K^*(C))$, (see

Figures (1)

  • Figure 1: Calibration plots for the tasks, assessed on 1,000 i.i.d. test samples of $C$ with calibration performed using the estimated $\widetilde{C}$, affirming \ref{['thm:amb_ground_truth']}.

Theorems & Definitions (29)

  • Definition 3.3
  • Theorem 3.4: Deterministic, continuous-time
  • Theorem 3.5: Stochastic, continuous-time
  • Theorem 3.6
  • Theorem 3.7
  • Definition A.1
  • Lemma A.2
  • proof
  • Theorem B.1
  • proof
  • ...and 19 more