Endangered Alert: A Field-Validated Self-Training Scheme for Detecting and Protecting Threatened Wildlife on Roads and Roadsides
Kunming Li, Mao Shan, Stephany Berrio Perez, Katie Luo, Stewart Worrall
TL;DR
This work tackles the challenge of detecting rare wildlife on roads under data-scarce, resource-constrained conditions by introducing a cloud–edge self-training framework augmented with Label-Augmentation Non-Maximum Suppression (LA-NMS). Stage 1 generates synthetic data on the cloud from web images, using LA-NMS and vision-language models (OWL-ViT and SAM) to produce pseudo-labels for an initial edge detector. Stage 2 deploys the edge model, collects field data, and iteratively refines the detector by auto-labelling field data on the cloud and fine-tuning the edge model, enabling continuous adaptation to new environments and a thermal-domain extension. A five-month field deployment demonstrates improved detection accuracy, higher prediction confidence, and practical viability for real-time driver alerts, highlighting the approach’s potential to mitigate animal–vehicle collisions in remote, bandwidth-limited settings.
Abstract
Traffic accidents are a global safety concern, resulting in numerous fatalities each year. A considerable number of these deaths are caused by animal-vehicle collisions (AVCs), which not only endanger human lives but also present serious risks to animal populations. This paper presents an innovative self-training methodology aimed at detecting rare animals, such as the cassowary in Australia, whose survival is threatened by road accidents. The proposed method addresses critical real-world challenges, including acquiring and labelling sensor data for rare animal species in resource-limited environments. It achieves this by leveraging cloud and edge computing, and automatic data labelling to improve the detection performance of the field-deployed model iteratively. Our approach introduces Label-Augmentation Non-Maximum Suppression (LA-NMS), which incorporates a vision-language model (VLM) to enable automated data labelling. During a five-month deployment, we confirmed the method's robustness and effectiveness, resulting in improved object detection accuracy and increased prediction confidence. The source code is available: https://github.com/acfr/CassDetect
