DEAL-YOLO: Drone-based Efficient Animal Localization using YOLO
Aditya Prashant Naidu, Hem Gosalia, Ishaan Gakhar, Shaurya Singh Rathore, Krish Didwania, Ujjwal Verma
TL;DR
This work tackles the problem of small-object wildlife detection in UAV imagery by enhancing a YOLOv8-based detector with center-focused and smoothness losses (Wise IoU and Normalized Wasserstein Distance), efficient feature extraction via Linear Deformable Convolutions, and a multi-scale fusion module (Scaled Sequence Feature Fusion). A two-stage confidence-guided ROI refinement further improves localization for low-confidence detections. Empirical results on BuckTales and WAID show substantial parameter reductions (up to ~87% fewer parameters in some cases) while maintaining or surpassing state-of-the-art accuracy, demonstrating strong potential for real-world wildlife monitoring and conservation tasks.
Abstract
Although advances in deep learning and aerial surveillance technology are improving wildlife conservation efforts, complex and erratic environmental conditions still pose a problem, requiring innovative solutions for cost-effective small animal detection. This work introduces DEAL-YOLO, a novel approach that improves small object detection in Unmanned Aerial Vehicle (UAV) images by using multi-objective loss functions like Wise IoU (WIoU) and Normalized Wasserstein Distance (NWD), which prioritize pixels near the centre of the bounding box, ensuring smoother localization and reducing abrupt deviations. Additionally, the model is optimized through efficient feature extraction with Linear Deformable (LD) convolutions, enhancing accuracy while maintaining computational efficiency. The Scaled Sequence Feature Fusion (SSFF) module enhances object detection by effectively capturing inter-scale relationships, improving feature representation, and boosting metrics through optimized multiscale fusion. Comparison with baseline models reveals high efficacy with up to 69.5\% fewer parameters compared to vanilla Yolov8-N, highlighting the robustness of the proposed modifications. Through this approach, our paper aims to facilitate the detection of endangered species, animal population analysis, habitat monitoring, biodiversity research, and various other applications that enrich wildlife conservation efforts. DEAL-YOLO employs a two-stage inference paradigm for object detection, refining selected regions to improve localization and confidence. This approach enhances performance, especially for small instances with low objectness scores.
