YOLOMG: Vision-based Drone-to-Drone Detection with Appearance and Pixel-Level Motion Fusion

Hanqing Guo; Xiuxiu Lin; Shiyu Zhao

YOLOMG: Vision-based Drone-to-Drone Detection with Appearance and Pixel-Level Motion Fusion

Hanqing Guo, Xiuxiu Lin, Shiyu Zhao

TL;DR

This work tackles the challenge of detecting extremely small drones in complex scenes with substantial ego-motion. It introduces YOLOMG, a motion-guided detector that fuses a pixel-level motion difference map with RGB appearance through a bimodal adaptive fusion module, powered by a lightweight YOLOv5-based backbone. The authors validate their approach on the ARD100 dataset and the NPS-Drones dataset, demonstrating superior AP and robust generalization, including under low-light conditions. The study provides practical implications for real-time, reliable drone detection in aerial applications and contributes a new, challenging benchmark for future research.

Abstract

Vision-based drone-to-drone detection has attracted increasing attention due to its importance in numerous tasks such as vision-based swarming, aerial see-and-avoid, and malicious drone detection. However, existing methods often encounter failures when the background is complex or the target is tiny. This paper proposes a novel end-to-end framework that accurately identifies small drones in complex environments using motion guidance. It starts by creating a motion difference map to capture the motion characteristics of tiny drones. Next, this motion difference map is combined with an RGB image using a bimodal fusion module, allowing for adaptive feature learning of the drone. Finally, the fused feature map is processed through an enhanced backbone and detection head based on the YOLOv5 framework to achieve accurate detection results. To validate our method, we propose a new dataset, named ARD100, which comprises 100 videos (202,467 frames) covering various challenging conditions and has the smallest average object size compared with the existing drone detection datasets. Extensive experiments on the ARD100 and NPS-Drones datasets show that our proposed detector performs exceptionally well under challenging conditions and surpasses state-of-the-art algorithms across various metrics. We publicly release the codes and ARD100 dataset at https://github.com/Irisky123/YOLOMG.

YOLOMG: Vision-based Drone-to-Drone Detection with Appearance and Pixel-Level Motion Fusion

TL;DR

Abstract

YOLOMG: Vision-based Drone-to-Drone Detection with Appearance and Pixel-Level Motion Fusion

TL;DR

Abstract

Paper Structure

Table of Contents

Figures (8)