Cooperative Search and Track of Rogue Drones using Multiagent Reinforcement Learning
Panayiota Valianti, Kleanthis Malialis, Panayiotis Kolios, Georgios Ellinas
TL;DR
The paper tackles the problem of locating and tracking rogue drones over sensitive facilities using a team of pursuer UAVs with limited sensing. It models joint search-and-track as a multiagent MDP and introduces a cooperative Q-learning framework with difference rewards to address credit assignment, aiming to maximize $F = \sum_{t=1}^{T} \sum_{i \in I} N^i_t$. Empirical results show that difference rewards yield faster learning and higher final performance than a global reward, especially as the number of agents grows, approaching near-optimal coverage when target trajectories are known. The work provides a scalable, model-free solution for coordinated search-and-track and outlines a path toward integrated search-track-jam interception and real-world field trials.
Abstract
This work considers the problem of intercepting rogue drones targeting sensitive critical infrastructure facilities. While current interception technologies focus mainly on the jamming/spoofing tasks, the challenges of effectively locating and tracking rogue drones have not received adequate attention. Solving this problem and integrating with recently proposed interception techniques will enable a holistic system that can reliably detect, track, and neutralize rogue drones. Specifically, this work considers a team of pursuer UAVs that can search, detect, and track multiple rogue drones over a sensitive facility. The joint search and track problem is addressed through a novel multiagent reinforcement learning scheme to optimize the agent mobility control actions that maximize the number of rogue drones detected and tracked. The performance of the proposed system is investigated under realistic settings through extensive simulation experiments with varying number of agents demonstrating both its performance and scalability.
