Image-Based Leopard Seal Recognition: Approaches and Challenges in Current Automated Systems
Jorge Yero Salazar, Pablo Rivas, Renato Borras-Chavez, Sarah Kienle
TL;DR
This work addresses the problem of recognizing leopard seals from conventional wildlife photography in harsh, remote habitats where data collection is labor-intensive. It surveys detection and segmentation methods, contrasting manual landmark approaches with automated detectors and analyzes the potential of Vision Transformers and the Segment Anything Model for marine wildlife imaging. The paper synthesizes SOTA methods for conventional imagery of pinnipeds, discusses key challenges, and highlights ViT and SAM as promising avenues for robust, scalable seal recognition. Ultimately, it offers guidance for developing non-intrusive, efficient monitoring tools that can aid conservation and ecosystem health assessment in Antarctic environments.
Abstract
This paper examines the challenges and advancements in recognizing seals within their natural habitats using conventional photography, underscored by the emergence of machine learning technologies. We used the leopard seal, \emph{Hydrurga leptonyx}, a key species within Antarctic ecosystems, to review the different available methods found. As apex predators, Leopard seals are characterized by their significant ecological role and elusive nature so studying them is crucial to understand the health of their ecosystem. Traditional methods of monitoring seal species are often constrained by the labor-intensive and time-consuming processes required for collecting data, compounded by the limited insights these methods provide. The advent of machine learning, particularly through the application of vision transformers, heralds a new era of efficiency and precision in species monitoring. By leveraging state-of-the-art approaches in detection, segmentation, and recognition within digital imaging, this paper presents a synthesis of the current landscape, highlighting both the cutting-edge methodologies and the predominant challenges faced in accurately identifying seals through photographic data.
