Causality-Informed Data-Driven Predictive Control
Malika Sader, Yibo Wang, Dexian Huang, Chao Shang, Biao Huang
TL;DR
The paper tackles the degradation of data-driven predictive control (DDPC) performance under uncertainty by diagnosing lack of causality as a key source of high prediction variance. It develops a causality-informed DDPC framework using $LQ$ factorization, introducing a causal $oldsymbol{ extgamma}$-DDPC (C-$oldsymbol{ extgamma}$-DDPC) and its regularized variant (RC-$oldsymbol{ extgamma}$-DDPC) that enforce a strictly causal multi-step predictor with minimal additional complexity. By connecting causality to an explicit $LQ$-based representation, the authors derive a simple yet powerful reformulation that reduces non-causal residuals and balances control cost against implicit predictor identification. Numerical studies on stochastic LTI and nonlinear systems, plus a simulated industrial heating furnace, show that enforcing causality improves prediction accuracy and control performance, particularly under noise, model mismatch, or nonlinearity, with competitive computation times relative to noncausal approaches. The work offers a practical, data-driven pathway to more reliable predictive control in industrial applications where uncertainty and nonlinearities are prevalent.
Abstract
As a useful and efficient alternative to generic model-based control scheme, data-driven predictive control is subject to bias-variance trade-off and is known to not perform desirably in face of uncertainty. Through the connection between direct data-driven control and subspace predictive control, we gain insight into the reason being the lack of causality as a main cause for high variance of implicit prediction. In this article, we seek to address this deficiency by devising a novel causality-informed formulation of direct data-driven control. Built upon LQ factorization, an equivalent two-stage reformulation of regularized data-driven control is first derived, which bears clearer interpretability and a lower complexity than generic forms. This paves the way for deriving a two-stage causality-informed formulation of data-driven predictive control, as well as a regularized form that balances between control cost minimization and implicit identification of multi-step predictor. Since it only calls for block-triangularization of a submatrix in LQ factorization, the new causality-informed formulation comes at no excess cost as compared to generic ones. Its efficacy is investigated based on numerical examples and application to model-free control of a simulated industrial heating furnace. Empirical results corroborate that the proposed method yields obvious performance improvement over existing formulations in handling stochastic noise and process nonlinearity.
