Automatic Detection and Annotation of Sperm Whale Codas
Guy Gubnitsky, Yaly Mevorach, Shane Gero, David F. Gruber, Roee Diamant
TL;DR
This work presents the first automatic detector and annotator for sperm whale codas, addressing the lack of scalable tools for automatic detection, annotation, and source separation of codas in passive acoustic data. It introduces a graph-based clustering framework that leverages the similarity of multi-pulse coda clicks, with structural and temporal likelihood analyses to distinguish codas from echolocation and to assign clicks to the same coda across overlapping sources. The approach enables large-scale coda characterization, revealing synchronized exchanges between whale pairs, discovering two new coda types, and enabling quantitative analyses of coda type distributions, inter-coda timing, and information transfer in codas. The method’s demonstrated robustness at low SNR and across near-field and far-field recordings supports real-time deployment for crewed and autonomous platforms, advancing both cetacean communication research and conservation management.
Abstract
A key technology in sperm whale (Physeter macrocephalus) monitoring is the identification of sperm whale communication signals, known as codas. In this paper we present the first automatic coda detector and annotator. The main innovation in our detector is graph-based clustering, which utilizes the expected similarity between the clicks that make up the coda. Results show detection and accurate annotation at low signal-to-noise ratios, separation between codas and echolocation clicks, and discrimination between codas from simultaneously emitting whales. Using this automatic annotator, insights into the characterization of sperm whale communication are presented. The results include new types of coda signals, analyzes of the distribution of coda types among different whales and for different years, and evidence for synchronization between communicating whales in terms of coda type and coda transmission time. These results indicate a high degree of complexity in the communication system of this cetacean species. To ensure traceability, we share the implementation code of our coda detector.
