Multi-Species Object Detection in Drone Imagery for Population Monitoring of Endangered Animals
Sowmya Sankaran
TL;DR
The paper tackles the problem of counting endangered wildlife from drone imagery by fine-tuning the YOLOv8 object detector for multi-species aerial data. It employs extensive data collection (baseline and African safari datasets), large-scale hyperparameter optimization across 30 model variants, and data augmentation to achieve high accuracy, including a peak mAP-50 of 98.2% on elephants. Compared with the YOLOv8 baseline, the approach delivers up to 135x higher accuracy on safari datasets and enables real-time inference (approximately 40 fps) on the Jetson Orin Nano, demonstrating viable low-power deployment for field use. The work provides a cost-effective, edge AI solution for conservation monitoring, with public availability of datasets and logs to support reproducibility and broader deployment in wildlife refuges and by park rangers.
Abstract
Animal populations worldwide are rapidly declining, and a technology that can accurately count endangered species could be vital for monitoring population changes over several years. This research focused on fine-tuning object detection models for drone images to create accurate counts of animal species. Hundreds of images taken using a drone and large, openly available drone-image datasets were used to fine-tune machine learning models with the baseline YOLOv8 architecture. We trained 30 different models, with the largest having 43.7 million parameters and 365 layers, and used hyperparameter tuning and data augmentation techniques to improve accuracy. While the state-of-the-art YOLOv8 baseline had only 0.7% accuracy on a dataset of safari animals, our models had 95% accuracy on the same dataset. Finally, we deployed the models on the Jetson Orin Nano for demonstration of low-power real-time species detection for easy inference on drones.
