A machine learning pipeline for automated insect monitoring
Aditya Jain, Fagner Cunha, Michael Bunsen, Léonard Pasi, Anna Viklund, Maxim Larrivée, David Rolnick
TL;DR
The paper tackles scalable insect monitoring amid widespread insect decline by presenting a complete, open-source pipeline for automated moth monitoring from camera traps. The pipeline comprises detection, moth/non-moth classification, fine-grained moth species identification, and tracking, with zero-shot GBIF/iNaturalist data and synthetic-data training to reduce manual labeling. It reports region-specific validation, data augmentation strategies, and long-tail handling, and provides an AMI Data Companion and Web Platform to facilitate adoption by ecologists with varying ML expertise. The work is deployed across multiple continents and aims to enable massively scalable insect data collection to inform land use decisions and climate adaptation policies.
Abstract
Climate change and other anthropogenic factors have led to a catastrophic decline in insects, endangering both biodiversity and the ecosystem services on which human society depends. Data on insect abundance, however, remains woefully inadequate. Camera traps, conventionally used for monitoring terrestrial vertebrates, are now being modified for insects, especially moths. We describe a complete, open-source machine learning-based software pipeline for automated monitoring of moths via camera traps, including object detection, moth/non-moth classification, fine-grained identification of moth species, and tracking individuals. We believe that our tools, which are already in use across three continents, represent the future of massively scalable data collection in entomology.
