Lyapunov-based reinforcement learning for distributed control with stability guarantee

Jingshi Yao; Minghao Han; Xunyuan Yin

Lyapunov-based reinforcement learning for distributed control with stability guarantee

Jingshi Yao, Minghao Han, Xunyuan Yin

TL;DR

This work tackles stability-guaranteed distributed control of stochastic nonlinear systems by marrying Lyapunov theory with model-free reinforcement learning. It introduces the Distributed Lyapunov Actor-Critic (DLAC) framework, where each subsystem maintains a local critic and a Gaussian policy, guided by Lyapunov functions L^i to ensure mean-cost stability across the interconnected MDP. Theoretical conditions establish a sufficient Lyapunov-based criterion for stability, while the training procedure uses distributed critic training and collaborative but minimal-information-exchange actor updates, yielding decentralized execution. Empirical results on a three-tank chemical process show convergence, robustness to disturbances, and effective tracking of multiple references, with DLAC outperforming open-loop control and competing with NMPC without requiring first-principles models. The approach offers a scalable, data-driven avenue for stable distributed control in complex industrial systems, with potential extensions to sequential communications and POMDP settings.

Abstract

In this paper, we propose a Lyapunov-based reinforcement learning method for distributed control of nonlinear systems comprising interacting subsystems with guaranteed closed-loop stability. Specifically, we conduct a detailed stability analysis and derive sufficient conditions that ensure closed-loop stability under a model-free distributed control scheme based on the Lyapunov theorem. The Lyapunov-based conditions are leveraged to guide the design of local reinforcement learning control policies for each subsystem. The local controllers only exchange scalar-valued information during the training phase, yet they do not need to communicate once the training is completed and the controllers are implemented online. The effectiveness and performance of the proposed method are evaluated using a benchmark chemical process that contains two reactors and one separator.

Lyapunov-based reinforcement learning for distributed control with stability guarantee

TL;DR

Abstract

Lyapunov-based reinforcement learning for distributed control with stability guarantee

TL;DR

Abstract

Paper Structure

Table of Contents

Key Result

Figures (8)

Theorems & Definitions (6)