Resilient Learning-Based Control Under Denial-of-Service Attacks

Sayan Chakraborty; Weinan Gao; Kyriakos G. Vamvoudakis; Zhong-Ping Jiang

Resilient Learning-Based Control Under Denial-of-Service Attacks

Sayan Chakraborty, Weinan Gao, Kyriakos G. Vamvoudakis, Zhong-Ping Jiang

TL;DR

The paper addresses robust, data-driven output regulation for discrete-time linear systems with unknown parameters subject to Denial-of-Service (DoS) attacks. It introduces a resilient online policy-iteration method that learns the optimal controller from input-state data while an internal-model component preserves stability under DoS, and derives an explicit bound on DoS duration $T^ op$ to guarantee asymptotic tracking. The approach is validated on an inverted pendulum on a cart, illustrating that the learned controller maintains reference tracking despite intermittent communication outages. This work advances cyber-physical resiliency by combining policy iteration with DoS-aware stability analysis in a data-driven, model-free setting.

Abstract

In this paper, we have proposed a resilient reinforcement learning method for discrete-time linear systems with unknown parameters, under denial-of-service (DoS) attacks. The proposed method is based on policy iteration that learns the optimal controller from input-state data amidst DoS attacks. We achieve an upper bound for the DoS duration to ensure closed-loop stability. The resilience of the closed-loop system, when subjected to DoS attacks with the learned controller and an internal model, has been thoroughly examined. The effectiveness of the proposed methodology is demonstrated on an inverted pendulum on a cart.

Resilient Learning-Based Control Under Denial-of-Service Attacks

TL;DR

to guarantee asymptotic tracking. The approach is validated on an inverted pendulum on a cart, illustrating that the learned controller maintains reference tracking despite intermittent communication outages. This work advances cyber-physical resiliency by combining policy iteration with DoS-aware stability analysis in a data-driven, model-free setting.

Abstract

Paper Structure (6 sections, 3 theorems, 43 equations, 3 figures, 1 table, 1 algorithm)

This paper contains 6 sections, 3 theorems, 43 equations, 3 figures, 1 table, 1 algorithm.

INTRODUCTION
Preliminaries and problem formulation
Resilience analysis under DoS Attacks
Learning-based design under DoS Attacks
Simulation Results and Discussion
Conclusion and Future Works

Key Result

Lemma II.1

Under Assumptions assum:1-assum:2, if there exists a state-feedback controller such that the closed-loop system matrix of the augmented system eq:augSys1 is Schur. Then, the controller eq:feedContr solves the output regulation problem.

Figures (3)

Figure 1: Tracking and disturbance rejection under DoS attacks.
Figure 2: Convergence of $K_{j}$ to $K^\star$.
Figure 3: Convergence of $P_{j}$ to $P^\star$.

Theorems & Definitions (10)

Definition II.1
Remark II.1
Lemma II.1
proof
Theorem III.1
proof
Lemma III.1
proof
Remark IV.1
Remark IV.2

Resilient Learning-Based Control Under Denial-of-Service Attacks

TL;DR

Abstract

Resilient Learning-Based Control Under Denial-of-Service Attacks

Authors

TL;DR

Abstract

Table of Contents

Key Result

Figures (3)

Theorems & Definitions (10)