Exact Learning of Linear Model Predictive Control Laws using Oblique Decision Trees with Linear Predictions

Jiayang Ren; Qiangqiang Mao; Tianwei Zhao; Yankai Cao

Exact Learning of Linear Model Predictive Control Laws using Oblique Decision Trees with Linear Predictions

Jiayang Ren, Qiangqiang Mao, Tianwei Zhao, Yankai Cao

TL;DR

This work introduces an exact-learning framework that maps the Linear MPC control law to oblique decision trees with linear predictions (ODT-LP). It proves CPWA equivalence between Linear MPC and finite-depth ODT-LP, and develops a gradient-based offline training pipeline using grid-generated state-action data with smooth indicator relaxations. The approach yields controllers that match MPC performance while delivering orders-of-magnitude faster online evaluation and enabling verifiable, interpretable control logic. Robustness and ISS analyses show the method remains stable under bounded approximation errors and disturbances, making it suitable for safety-critical applications.

Abstract

Model Predictive Control (MPC) is a powerful strategy for constrained multivariable systems but faces computational challenges in real-time deployment due to its online optimization requirements. While explicit MPC and neural network approximations mitigate this burden, they suffer from scalability issues or lack interpretability, limiting their applicability in safety-critical systems. This work introduces a data-driven framework that directly learns the Linear MPC control law from sampled state-action pairs using Oblique Decision Trees with Linear Predictions (ODT-LP), achieving both computational efficiency and interpretability. By leveraging the piecewise affine structure of Linear MPC, we prove that the Linear MPC control law can be replicated by finite-depth ODT-LP models. We develop a gradient-based training algorithm using smooth approximations of tree routing functions to learn this structure from grid-sampled Linear MPC solutions, enabling end-to-end optimization. Input-to-state stability is established under bounded approximation errors, with explicit error decomposition into learning inaccuracies and sampling errors to inform model design. Numerical experiments demonstrate that ODT-LP controllers match MPC's closed-loop performance while reducing online evaluation time by orders of magnitude compared to MPC, explicit MPC, neural network, and random forest counterparts. The transparent tree structure enables formal verification of control logic, bridging the gap between computational efficiency and certifiable reliability for safety-critical systems.

Exact Learning of Linear Model Predictive Control Laws using Oblique Decision Trees with Linear Predictions

TL;DR

Abstract

Exact Learning of Linear Model Predictive Control Laws using Oblique Decision Trees with Linear Predictions

TL;DR

Abstract

Paper Structure

Table of Contents

Key Result

Figures (4)

Theorems & Definitions (12)