Domain-invariant Progressive Knowledge Distillation for UAV-based Object Detection
Liang Yao, Fan Liu, Chuanyi Zhang, Zhiquan Ou, Ting Wu
TL;DR
This work tackles UAV-based object detection under tight compute constraints by addressing two core issues in knowledge distillation: the large scale gap between teacher and student models and domain variations from complex UAV scenes. It introduces a progressive distillation framework with a senior and a junior teacher to gradually transfer knowledge, coupled with a domain-invariant transfer using Fast Fourier Transform to separate amplitude (domain-dependent) and phase (domain-invariant) information. The proposed approach achieves state-of-the-art results on VisDrone and SynDrone, outperforming prior KD methods and even surpassing the teacher in some metrics, while ablations highlight the additive gains from both components. The findings suggest that combining progressive teaching with FFT-based domain-invariant transfer can robustly improve lightweight detectors for UAV-OD in diverse environments.
Abstract
Knowledge distillation (KD) is an effective method for compressing models in object detection tasks. Due to limited computational capability, UAV-based object detection (UAV-OD) widely adopt the KD technique to obtain lightweight detectors. Existing methods often overlook the significant differences in feature space caused by the large gap in scale between the teacher and student models. This limitation hampers the efficiency of knowledge transfer during the distillation process. Furthermore, the complex backgrounds in UAV images make it challenging for the student model to efficiently learn the object features. In this paper, we propose a novel knowledge distillation framework for UAV-OD. Specifically, a progressive distillation approach is designed to alleviate the feature gap between teacher and student models. Then a new feature alignment method is provided to extract object-related features for enhancing student model's knowledge reception efficiency. Finally, extensive experiments are conducted to validate the effectiveness of our proposed approach. The results demonstrate that our proposed method achieves state-of-the-art (SoTA) performance in two UAV-OD datasets.
