Constrained Performance Boosting Control for Nonlinear Systems via ADMM
Gianluca Giacomelli, Danilo Saccani, Siep Weiland, Giancarlo Ferrari-Trecate, Valentina Breschi
TL;DR
This work tackles boosting nonlinear system performance under hard input and state constraints while preserving stability. It extends the prior Performance Boosting (PB) approach by embedding it in an ADMM framework (ADMM-PB) to handle constraints without altering the original neural controller architecture, leveraging an IMC-based stability guarantee. The method introduces copy variables, an augmented Lagrangian, and alternating optimization over a learnable operator $\mathbfcal{M}$ parameterized by $\theta$, together with projection steps onto convex constraint sets. Numerical results on a constrained point-mass robot benchmark show that ADMM-PB yields smoother training and better constraint satisfaction than a barrier-based baseline, with a trade-off toward more conservative trajectories at higher constraint emphasis. This approach provides a principled, scalable route to performance enhancement for nonlinear systems where constraint satisfaction is critical in safety-critical applications.
Abstract
We present the Alternating Direction Method of Multipliers for Performance Boosting (ADMM-PB), an approach to design performance boosting controllers for stable or pre-stabilized nonlinear systems, while explicitly seeking input and state constraint satisfaction. Rooted on a recently proposed approach for designing neural-network controllers that guarantees closed-loop stability by design while minimizing generic cost functions, our strategy integrates it within an alternating direction method of multipliers routine to seek constraint handling without modifying the controller structure of the aforementioned seminal strategy. Our numerical results showcase the advantages of the proposed approach over a baseline penalizing constraint violation through barrier-like terms in the cost, indicating that ADMM-PB can lead to considerably lower constraint violations at the price of inducing slightly more cautious closed-loop behaviors.
