Animal Identification with Independent Foreground and Background Modeling
Lukas Picek, Lukas Neumann, Jiri Matas
TL;DR
This work tackles wildlife re-identification under domain shifts by explicitly modeling both foreground appearance and background context. It introduces Per-Instance Temperature Scaling (PITS) to calibrate per-sample predictions and fuses foreground likelihood $\Phi(x_{FG})$ with background priors $\Psi_k(x_{BG},x_{\theta})$ through $p(k|x_{FG},x_{BG},x_{\theta})$, achieving calibrated, robust identification. Background priors based on home, migrating, and time-decay locations are validated on two long-term wildlife datasets (Eurasian lynx and Caretta caretta), with substantial gains, especially for new locations. Ablation studies show calibration improvements and cross-architecture gains, though performance remains dependent on FG/BG segmentation quality and task-specific priors.
Abstract
We propose a method that robustly exploits background and foreground in visual identification of individual animals. Experiments show that their automatic separation, made easy with methods like Segment Anything, together with independent foreground and background-related modeling, improves results. The two predictions are combined in a principled way, thanks to novel Per-Instance Temperature Scaling that helps the classifier to deal with appearance ambiguities in training and to produce calibrated outputs in the inference phase. For identity prediction from the background, we propose novel spatial and temporal models. On two problems, the relative error w.r.t. the baseline was reduced by 22.3% and 8.8%, respectively. For cases where objects appear in new locations, an example of background drift, accuracy doubles.
