Streamlined Hybrid Annotation Framework using Scalable Codestream for Bandwidth-Restricted UAV Object Detection
Karim El Khoury, Tiffanie Godelaine, Simon Delvaux, Sebastien Lugan, Benoit Macq
TL;DR
Emergency UAV workflows are constrained by limited bandwidth, which delays critical decisions. The paper introduces a bandwidth-aware streamlined hybrid annotation framework that leverages a JPEG 2000 scalable codestream to run a fine-tuned DL detector on low-resolution data and selectively fetch high-resolution tiles for human annotation in regions of interest, achieving up to a $34$-fold reduction in decision time. Across nine emergency scenarios and varying data rates, the approach maintains competitive recall while dramatically accelerating response, illustrating the practicality of on-board, multi-resolution transmission and selective high-detail annotation. This work enables faster, bandwidth-adaptive UAV object detection and points to future enhancements via few-shot and active learning for continued performance gain in dynamic emergency contexts.
Abstract
Emergency response missions depend on the fast relay of visual information, a task to which unmanned aerial vehicles are well adapted. However, the effective use of unmanned aerial vehicles is often compromised by bandwidth limitations that impede fast data transmission, thereby delaying the quick decision-making necessary in emergency situations. To address these challenges, this paper presents a streamlined hybrid annotation framework that utilizes the JPEG 2000 compression algorithm to facilitate object detection under limited bandwidth. The proposed framework employs a fine-tuned deep learning network for initial image annotation at lower resolutions and uses JPEG 2000's scalable codestream to selectively enhance the image resolution in critical areas that require human expert annotation. We show that our proposed hybrid framework reduces the response time by a factor of 34 in emergency situations compared to a baseline approach.
