chatter: a Python library for applying information theory and AI/ML models to animal communication
Mason Youngblood
TL;DR
The paper addresses the limitation of discrete categorization in animal communication analysis by introducing chatter, a Python library for continuous latent-space analysis of vocalizations using information theory and modern neural architectures to represent sequences as trajectories in high-dimensional latent space. It delivers an end-to-end workflow—from preprocessing and segmentation to latent-feature extraction and downstream metrics—enabling measurement of complexity, predictability, similarity, and novelty without unit labels. The approach is taxonomy-agnostic and validated across birds, bats, whales, and primates, integrating tools like BirdNET and PaCMAP visualizations to provide accessible, flexible analysis. Overall, chatter lowers the barrier to applying advanced continuous representations to animal vocal repertoires and offers a modular framework that complements discrete-analysis tools.
Abstract
The study of animal communication often involves categorizing units into types (e.g. syllables in songbirds, or notes in humpback whales). While this approach is useful in many cases, it necessarily flattens the complexity and nuance present in real communication systems. chatter is a new Python library for analyzing animal communication in continuous latent space using information theory and modern machine learning techniques. It is taxonomically agnostic, and has been tested with the vocalizations of birds, bats, whales, and primates. By leveraging a variety of different architectures, including variational autoencoders and vision transformers, chatter represents vocal sequences as trajectories in high-dimensional latent space, bypassing the need for manual or automatic categorization of units. The library provides an end-to-end workflow -- from preprocessing and segmentation to model training and feature extraction -- that enables researchers to quantify the complexity, predictability, similarity, and novelty of vocal sequences.
