Table of Contents
Fetching ...

Adapting the re-ID challenge for static sensors

Avirath Sundaresan, Jason R. Parham, Jonathan Crall, Rosemary Warungu, Timothy Muthami, Margaret Mwangi, Jackson Miliko, Jason Holmberg, Tanya Y. Berger-Wolf, Daniel Rubenstein, Charles V. Stewart, Sara Beery

TL;DR

The paper tackles the problem of scalable, accurate censusing of Grévy's zebras from both rally-style imagery and long-term camera-trap data, a setting where re-ID is challenging due to open-set identities and poor imaging. It introduces a semi-automatic pipeline that integrates detection, species/viewpoint classification, census annotations (CA/CA-R), and Local Clustering Analysis (LCA) with Hotspotter and VAMP to build consistent ID graphs while minimizing human input. Results on the GZCD dataset show population estimates within 4.6% of ground truth for one rally and within 0.5% for the other, with an automation rate of 99.1% for CA-R and substantial human-effort reductions. On a large camera-trap dataset, 685 CA-R across 173 individuals were identified with 93.9% automation, yielding robust spatial-temporal insights; the approach demonstrates scalable, cross-dataset census capabilities and sets the stage for future extensions to nocturnal data and temporal/spatial integration.

Abstract

In both 2016 and 2018, a census of the highly-endangered Grevy's zebra population was enabled by the Great Grevy's Rally (GGR), a citizen science event that produces population estimates via expert and algorithmic curation of volunteer-captured images. A complementary, scalable, and long-term Grevy's population monitoring approach involves deploying camera trap networks. However, in both scenarios, a substantial majority of zebra images are not usable for individual identification due to poor in-the-wild imaging conditions; camera trap images in particular present high rates of occlusion and high spatio-temporal similarity within image bursts. Our proposed filtering pipeline incorporates animal detection, species identification, viewpoint estimation, quality evaluation, and temporal subsampling to obtain individual crops suitable for re-ID, which are subsequently curated by the LCA decision management algorithm. Our method processed images taken during GGR-16 and GGR-18 in Meru County, Kenya, into 4,142 highly-comparable annotations, requiring only 120 contrastive human decisions to produce a population estimate within 4.6% of the ground-truth count. Our method also efficiently processed 8.9M unlabeled camera trap images from 70 cameras at the Mpala Research Centre in Laikipia County, Kenya over two years into 685 encounters of 173 individuals, requiring only 331 contrastive human decisions.

Adapting the re-ID challenge for static sensors

TL;DR

The paper tackles the problem of scalable, accurate censusing of Grévy's zebras from both rally-style imagery and long-term camera-trap data, a setting where re-ID is challenging due to open-set identities and poor imaging. It introduces a semi-automatic pipeline that integrates detection, species/viewpoint classification, census annotations (CA/CA-R), and Local Clustering Analysis (LCA) with Hotspotter and VAMP to build consistent ID graphs while minimizing human input. Results on the GZCD dataset show population estimates within 4.6% of ground truth for one rally and within 0.5% for the other, with an automation rate of 99.1% for CA-R and substantial human-effort reductions. On a large camera-trap dataset, 685 CA-R across 173 individuals were identified with 93.9% automation, yielding robust spatial-temporal insights; the approach demonstrates scalable, cross-dataset census capabilities and sets the stage for future extensions to nocturnal data and temporal/spatial integration.

Abstract

In both 2016 and 2018, a census of the highly-endangered Grevy's zebra population was enabled by the Great Grevy's Rally (GGR), a citizen science event that produces population estimates via expert and algorithmic curation of volunteer-captured images. A complementary, scalable, and long-term Grevy's population monitoring approach involves deploying camera trap networks. However, in both scenarios, a substantial majority of zebra images are not usable for individual identification due to poor in-the-wild imaging conditions; camera trap images in particular present high rates of occlusion and high spatio-temporal similarity within image bursts. Our proposed filtering pipeline incorporates animal detection, species identification, viewpoint estimation, quality evaluation, and temporal subsampling to obtain individual crops suitable for re-ID, which are subsequently curated by the LCA decision management algorithm. Our method processed images taken during GGR-16 and GGR-18 in Meru County, Kenya, into 4,142 highly-comparable annotations, requiring only 120 contrastive human decisions to produce a population estimate within 4.6% of the ground-truth count. Our method also efficiently processed 8.9M unlabeled camera trap images from 70 cameras at the Mpala Research Centre in Laikipia County, Kenya over two years into 685 encounters of 173 individuals, requiring only 331 contrastive human decisions.

Paper Structure

This paper contains 13 sections, 11 figures, 2 tables.

Figures (11)

  • Figure 1: Map of several encounters of a single individual within Mpala Research Centre identified from camera trap data by our method and requiring minimal human labeling.
  • Figure 2: (Left) Location of our camera traps across the Mpala Research Centre. These cameras are placed according to four distinct strategies: random grid, randomly along roads, at known Grévy's territories along roads, and at magnets such as salt licks and watering holes. (Right) Map of all Grévy's encounters across Mpala.
  • Figure 3: Map of image GPS locations in the GZCD dataset. Meru County, Kenya (in red) is located north of the capital (star) and is at the base of Mt. Kenya. Includes all images by photographers that took images in Meru County, even if they were not taken in that county.
  • Figure 4: Example images of Census Annotations for Grévy’s zebra. Annotations on the left (boxed in red) were marked as non-CAs by a reviewer, and annotations on the right were marked as CAs.
  • Figure 5: Example of a good and poor quality CA of an individual within an encounter. (Left) Annotation with CA score of 0.9997; the hip and shoulder chevrons are very clearly visible. (Right) Annotation with CA score of 0.0032; viewpoint is correct but the hip and shoulder chevrons are not clearly visible.
  • ...and 6 more figures