A Distance-based Anomaly Detection Framework for Deep Reinforcement Learning

Hongming Zhang; Ke Sun; Bo Xu; Linglong Kong; Martin Müller

A Distance-based Anomaly Detection Framework for Deep Reinforcement Learning

Hongming Zhang, Ke Sun, Bo Xu, Linglong Kong, Martin Müller

TL;DR

A novel Mahalanobis distance-based (MD) anomaly detection framework, called MDX, for deep RL algorithms that simultaneously addresses random, adversarial, and out-of-distribution state outliers in both offline and online settings and is extended to robust and distribution-free versions by incorporating Robust MD and conformal inference techniques.

Abstract

In deep reinforcement learning (RL) systems, abnormal states pose significant risks by potentially triggering unpredictable behaviors and unsafe actions, thus impeding the deployment of RL systems in real-world scenarios. It is crucial for reliable decision-making systems to have the capability to cast an alert whenever they encounter unfamiliar observations that they are not equipped to handle. In this paper, we propose a novel Mahalanobis distance-based (MD) anomaly detection framework, called \textit{MDX}, for deep RL algorithms. MDX simultaneously addresses random, adversarial, and out-of-distribution (OOD) state outliers in both offline and online settings. It utilizes Mahalanobis distance within class-conditional distributions for each action and operates within a statistical hypothesis testing framework under the Gaussian assumption. We further extend it to robust and distribution-free versions by incorporating Robust MD and conformal inference techniques. Through extensive experiments on classical control environments, Atari games, and autonomous driving scenarios, we demonstrate the effectiveness of our MD-based detection framework. MDX offers a simple, unified, and practical anomaly detection tool for enhancing the safety and reliability of RL systems in real-world applications.

A Distance-based Anomaly Detection Framework for Deep Reinforcement Learning

TL;DR

Abstract

A Distance-based Anomaly Detection Framework for Deep Reinforcement Learning

TL;DR

Abstract

Paper Structure

Table of Contents

Key Result

Figures (23)

Theorems & Definitions (2)