PanAf20K: A Large Video Dataset for Wild Ape Detection and Behaviour Recognition
Otto Brookes, Majid Mirmehdi, Colleen Stephens, Samuel Angedakin, Katherine Corogenes, Dervla Dowd, Paula Dieguez, Thurston C. Hicks, Sorrel Jones, Kevin Lee, Vera Leinert, Juan Lapuente, Maureen S. McCarthy, Amelia Meier, Mizuki Murai, Emmanuelle Normand, Virginie Vergnes, Erin G. Wessling, Roman M. Wittig, Kevin Langergraber, Nuria Maldonado, Xinyu Yang, Klaus Zuberbuhler, Christophe Boesch, Mimi Arandjelovic, Hjalmar Kuhl, Tilo Burghardt
TL;DR
PanAf20K introduces the largest open-access video collection of wild great apes, comprising over 7 million frames across ~20,000 camera-trap videos from 14 field sites, to support ape detection and behaviour recognition in natural habitats. The dataset couples PanAf20K (multi-label behaviour annotations) with PanAf500 (fine-grained, frame-level annotations including full-body location and intra-video IDs) to benchmark detection and multi-label recognition using diverse state-of-the-art architectures. Benchmark results reveal the value of in-domain pretraining and underscore tail-class challenges, with long-tail strategies improving rare behaviour recognition but leaving a substantial gap for less frequent actions. Overall, PanAf20K provides a scalable, ecologically valid platform for AI-enabled conservation analytics, enabling robust population assessment and behavioural studies in great apes.
Abstract
We present the PanAf20K dataset, the largest and most diverse open-access annotated video dataset of great apes in their natural environment. It comprises more than 7 million frames across ~20,000 camera trap videos of chimpanzees and gorillas collected at 14 field sites in tropical Africa as part of the Pan African Programme: The Cultured Chimpanzee. The footage is accompanied by a rich set of annotations and benchmarks making it suitable for training and testing a variety of challenging and ecologically important computer vision tasks including ape detection and behaviour recognition. Furthering AI analysis of camera trap information is critical given the International Union for Conservation of Nature now lists all species in the great ape family as either Endangered or Critically Endangered. We hope the dataset can form a solid basis for engagement of the AI community to improve performance, efficiency, and result interpretation in order to support assessments of great ape presence, abundance, distribution, and behaviour and thereby aid conservation efforts.
