SIT-LMPC: Safe Information-Theoretic Learning Model Predictive Control for Iterative Tasks
Zirui Zang, Ahmad Amine, Nick-Marios T. Kokolakis, Truong X. Nghiem, Ugo Rosolia, Rahul Mangharam
TL;DR
SIT-LMPC addresses safe, high-performance control for iterative tasks under uncertainty by extending LMPC to stochastic nonlinear systems and solving constrained optimization via a constrained MPPI with online adaptive penalties. Value function estimation uses normalizing flows to capture rich uncertainty, enabling safer trajectories, while parallel GPU execution yields real-time performance. Across point-mass, simulated autonomous racing, and real 1/5-scale vehicle experiments, SIT-LMPC achieves faster convergence and improved safety compared with LMPC and ABC-LMPC. This approach offers a scalable, data-driven framework for safe iterative learning in robotics with practical impact on real-time control under uncertainty.
Abstract
Robots executing iterative tasks in complex, uncertain environments require control strategies that balance robustness, safety, and high performance. This paper introduces a safe information-theoretic learning model predictive control (SIT-LMPC) algorithm for iterative tasks. Specifically, we design an iterative control framework based on an information-theoretic model predictive control algorithm to address a constrained infinite-horizon optimal control problem for discrete-time nonlinear stochastic systems. An adaptive penalty method is developed to ensure safety while balancing optimality. Trajectories from previous iterations are utilized to learn a value function using normalizing flows, which enables richer uncertainty modeling compared to Gaussian priors. SIT-LMPC is designed for highly parallel execution on graphics processing units, allowing efficient real-time optimization. Benchmark simulations and hardware experiments demonstrate that SIT-LMPC iteratively improves system performance while robustly satisfying system constraints.
