Table of Contents
Fetching ...

Drone Detection and Tracking with YOLO and a Rule-based Method

Purbaditya Bhattacharya, Patrick Nowak

TL;DR

This study addresses the need for automated drone detection in public spaces to protect privacy and safety by extending a multi-sensor dataset with annotated color images and additional infrared data. It trains YOLOv7-based detectors (initialized from COCO) on infrared and color subsets and investigates lightweight attention modules to boost accuracy, complemented by a simple rule-based cross-frame tracker to reduce detection drops. The results show strong single-image performance with reasonable inference times, and the rule-based tracker yields additional detections, improving robustness in video. The work demonstrates a practical drone monitoring pipeline that leverages multisensor data and a minimal tracking extension, with potential for live deployment after further model comparisons and multi-sensor calibration.

Abstract

Drones or unmanned aerial vehicles are traditionally used for military missions, warfare, and espionage. However, the usage of drones has significantly increased due to multiple industrial applications involving security and inspection, transportation, research purposes, and recreational drone flying. Such an increased volume of drone activity in public spaces requires regulatory actions for purposes of privacy protection and safety. Hence, detection of illegal drone activities such as boundary encroachment becomes a necessity. Such detection tasks are usually automated and performed by deep learning models which are trained on annotated image datasets. This paper builds on a previous work and extends an already published open source dataset. A description and analysis of the entire dataset is provided. The dataset is used to train the YOLOv7 deep learning model and some of its minor variants and the results are provided. Since the detection models are based on a single image input, a simple cross-correlation based tracker is used to reduce detection drops and improve tracking performance in videos. Finally, the entire drone detection system is summarized.

Drone Detection and Tracking with YOLO and a Rule-based Method

TL;DR

This study addresses the need for automated drone detection in public spaces to protect privacy and safety by extending a multi-sensor dataset with annotated color images and additional infrared data. It trains YOLOv7-based detectors (initialized from COCO) on infrared and color subsets and investigates lightweight attention modules to boost accuracy, complemented by a simple rule-based cross-frame tracker to reduce detection drops. The results show strong single-image performance with reasonable inference times, and the rule-based tracker yields additional detections, improving robustness in video. The work demonstrates a practical drone monitoring pipeline that leverages multisensor data and a minimal tracking extension, with potential for live deployment after further model comparisons and multi-sensor calibration.

Abstract

Drones or unmanned aerial vehicles are traditionally used for military missions, warfare, and espionage. However, the usage of drones has significantly increased due to multiple industrial applications involving security and inspection, transportation, research purposes, and recreational drone flying. Such an increased volume of drone activity in public spaces requires regulatory actions for purposes of privacy protection and safety. Hence, detection of illegal drone activities such as boundary encroachment becomes a necessity. Such detection tasks are usually automated and performed by deep learning models which are trained on annotated image datasets. This paper builds on a previous work and extends an already published open source dataset. A description and analysis of the entire dataset is provided. The dataset is used to train the YOLOv7 deep learning model and some of its minor variants and the results are provided. Since the detection models are based on a single image input, a simple cross-correlation based tracker is used to reduce detection drops and improve tracking performance in videos. Finally, the entire drone detection system is summarized.

Paper Structure

This paper contains 10 sections, 10 figures, 4 tables.

Figures (10)

  • Figure 1: Overview of the drone detection system.
  • Figure 2: Comparison of the image resolution of all cameras: FLIR Scion OTM366 (top left - $640 \times 480$ pixels), InfraTec VarioCAM HD Z (top right - $1024 \times 768$ pixels), and Sony $\alpha$6000 (bottom - $1920 \times 1080$ pixels)
  • Figure 3: Used cameras: FLIR Scion OTM366 (left), InfraTec VarioCAM HD Z (center), and Sony $\alpha$6000 (right).
  • Figure 4: Exemplary images of the dataset recorded with FLIR Scion OTM366 at (\ref{['fig:FLIRHafen']}) harbour, with InfraTec VarioCAM HD Z at (\ref{['fig:InfraTecHSU']}) campus, with Sony $\alpha$6000 at (\ref{['fig:SonyHafen1']}) harbour and (\ref{['fig:SonyHSU1']}) campus, and with FLIR Scion OTM366 and Sony $\alpha$6000 at (\ref{['fig:SonyFLIRHafen1']}, \ref{['fig:SonyFLIRHafen2']}) harbour and combined.
  • Figure 5: Annotation heatmap of the drones recorded by the Infrared cameras.
  • ...and 5 more figures