Optimal Control of Multi-Agent Systems with Processing Delays

Mruganka Kashyap; Laurent Lessard

Optimal Control of Multi-Agent Systems with Processing Delays

Mruganka Kashyap, Laurent Lessard

TL;DR

The paper tackles optimal control of a heterogeneous network of continuous-time LTI agents connected by a fixed directed graph with processing delays. It develops a convex Youla-parameterization-based framework that handles delay through a loop-shifting transformation and yields explicit, closed-form solutions for the $\mathcal{H}_2$-optimal decentralized controller, along with its cost, under $LQG$ assumptions. The results include both delay-free and delayed cases, with an agent-level implementation that exhibits an observer-regulator separation and uses finite-dimensional LTI and FIR components, enabling scalable, low-memory deployments. A thorough cost characterization is provided, decomposing the impact of centralization, decentralization, and delays, and synchronization demonstrations illustrate practical performance trends across network topologies. The framework offers exact solutions, interpretability via an observer-regulator structure, and practical guidelines for distributed deployment in multi-agent systems with known processing delays.

Abstract

In this article, we consider a cooperative control problem involving a heterogeneous network of dynamically decoupled continuous-time linear plants. The (output-feedback) controllers for each plant may communicate with each other according to a fixed and known transitively closed directed graph. Each transmission incurs a fixed and known time delay. We provide an explicit closed-form expression for the optimal decentralized controller and its associated cost under these communication constraints and standard linear quadratic Gaussian (LQG) assumptions for the plants and cost function. We find the exact solution without discretizing or otherwise approximating the delays. We also present an implementation of each sub-controller that is efficiently computable, and is composed of standard finite-dimensional linear time-invariant (LTI) and finite impulse response (FIR) components, and has an intuitive observer-regulator architecture reminiscent of the classical separation principle.

Optimal Control of Multi-Agent Systems with Processing Delays

TL;DR

-optimal decentralized controller, along with its cost, under

assumptions. The results include both delay-free and delayed cases, with an agent-level implementation that exhibits an observer-regulator separation and uses finite-dimensional LTI and FIR components, enabling scalable, low-memory deployments. A thorough cost characterization is provided, decomposing the impact of centralization, decentralization, and delays, and synchronization demonstrations illustrate practical performance trends across network topologies. The framework offers exact solutions, interpretability via an observer-regulator structure, and practical guidelines for distributed deployment in multi-agent systems with known processing delays.

Abstract

Paper Structure (31 sections, 9 theorems, 60 equations, 12 figures)

This paper contains 31 sections, 9 theorems, 60 equations, 12 figures.

Introduction
Literature review
Preliminaries
Transfer matrices.
Block indexing.
Delay
Problem statement
Riccati equations
Optimal Controller
Parameterization of stabilizing controllers
Optimal controller without delays
Optimal controller with delays
Agent-level controllers
Interpretation of optimal controller
Characterizing the cost
...and 16 more sections

Key Result

Lemma 3

Consider the structured optimal control problem described in sec:problem_statementwith $\mathcal{P}$ given by eq:fourblock and suppose Ass:System holds. Pick $F_d$ and $L_d$ block-diagonal such that $A+B_2 F_d$ and $A+L_d C_2$ are Hurwitz. The following are equivalent:

Figures (12)

Figure 1: Directed graph representing five interconnected systems.
Figure 2: The loop-shifting approach mirkin2009loopmirkin2012h2mirkin2011dead transforms a loop with adobe input delay (top left) into a modified system involving a rational plant $\tilde{\mathcal{P}}$ and FIR blocks $\Pi_u$ and $\Pi_b$ (right). This transformation $\Gamma$ is defined in \ref{['sec:appendix_gamma']}. We can recover $\mathcal{K}$ from $\tilde{\mathcal{K}}$ via the inverse transformation (bottom left).
Figure 3: Agent-level implementation of all structured stabilizing controllers, parameterized by $\hat{\mathcal{Q}}\in\mathcal{H}_\infty\cap\mathcal{S}_0$. Here, $F_d$ is any block-diagonal matrix such that $A_{ii} + B_{2_{ii}} F_{d_{ii}}$ is Hurwitz. The $\mathcal{H}_2$-optimal controller is achieved when $\hat{\mathcal{Q}}=0$, and results in the simplified diagram of \ref{['fig:agent-level-opt']}. The blocks that depend on the processing delay $\tau$ are colored in green. All symbols are defined in \ref{['thm:6']}.
Figure 4: Agent-level implementation of the $\mathcal{H}_2$-optimal controller with processing delays. This is the result of setting $\hat{\mathcal{Q}}=0$ in \ref{['fig:agent-level-subopt']}. The blocks that depend on the processing delay $\tau$ are colored in green. All symbols are defined in \ref{['thm:6']}.
Figure 5: Four-agent chain graph with standard broadcast (top) and efficient immediate-neighbor implementation (bottom), which is possible because this graph is a multitree.
...and 7 more figures

Theorems & Definitions (22)

Definition 2: Riccati assumptions
Lemma 3
Remark 4
Remark 5
Remark 6
Remark 7
Theorem 8
Remark 9
Remark 10
Remark 11
...and 12 more

Optimal Control of Multi-Agent Systems with Processing Delays

TL;DR

Abstract

Optimal Control of Multi-Agent Systems with Processing Delays

Authors

TL;DR

Abstract

Table of Contents

Key Result

Figures (12)

Theorems & Definitions (22)